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REMARKS 

Claims 1 to 3 and 6 to 17 are pending. Claims 18 and 19 have been added. Thus, 
claims 1 to 3 and 6 to 19 are presently under examination. 

Regarding the claim amendments and new claims 

Claims 8 and 16 have been amended to indicate that the recited library of diverse 
molecules is a library of diverse peptides or peptidomimetics. The amendment to claims 8 and 
16 is supported throughout the specification as originally filed, for example, at page 6, lines 
24-3 1 , which indicates that a molecule can be a peptide including a variant or modified peptide 
or pep tide-like molecule such as a peptidomimetic. 

New claims 18 and 19 have been added. New claim 18 is directed to a method of 
identifying a molecule that homes to a selected organ or tissue, where identification of the 
homing molecule includes mass spectrometry. New claim 1 8 is supported throughout the 
specification, for example, at page 8, lines 11-15, which indicates that a highly sensitive 
detection method such as mass spectrometry, either alone or in combination with a method such 
as gas chromatography, can be used to identify organ homing molecules in a sample of a selected 
organ or tissue. 

New claim 19 is directed to a method of identifying a molecule that homes to a 
selected organ or tissue by administering to a subject a library of diverse molecules other than a 
nucleic acid library; collecting a sample of the selected organ or tissue; identifying a molecule 
that homes to the selected organ or tissue; and individually administering to a subject the 
identified molecule which homes to the selected organ or tissue. New claim 19 is supported 
throughout the specification, for example, at page 36, lines 15-27, which discloses that, 
following identification of several brain-homing peptide motifs, phage displaying the 
predominant motifs were amplified individually, administered to mice, and recovered from brain 
and from a control organ, kidney. 
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As set forth above, the amendments and new claims are supported in the 
specification as originally filed and do not add new matter. Applicants therefore respectfully 
request that the Examiner enter the amendments and new claims. 

Regarding the Examiner interview 

Applicants appreciate the Examiner's consideration and discussion of the 
remaining issues in the case in the personal interview with Dr. Erkki Ruoslahti and Ms. Cathryn 
Campbell on April 20, 2004. The enablement rejection was discussed in view of a new Rule 132 
Declaration with new mass spectrometric data corroborating that the claimed methods can be 
practiced as set forth in the specification. The Examiner indicated the sufficiency of these 
corroborating data to remove the outstanding enablement rejection. The amendment of claims 8 
and 16 was further discussed in regard to the outstanding written description rejection. 

Regarding the terminal disclaimer 

As acknowledged on page 2 of the Office Action mailed November 26, 2003, a 
terminal disclaimer was filed on September 29, 2003, disclaiming the terminal portion of any 
patent granted on the subject application which would extend beyond the expiration date of U.S. 
Patent Nos. 5,622,699; 6,068,829; 6,296,832; or 6,306,365. Applicant appreciates the 
Examiner's acknowledgement that the terminal disclaimer has been reviewed and accepted and, 
further, has been recorded. 

Regarding the written description rejection 

The rejection of claims 8 and 16 under 35 U.S.C. § 1 12, first paragraph, as 
allegedly lacking written description, is respectfully traversed. 

Claims 8 and 16 are directed to methods practiced with "a library of diverse 
peptides and peptidomimetics." Although the Examiner acknowledges that the specification 
supports that the molecules of a library can be "peptides or peptide-like molecules," it is alleged 
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that written description is provided in the specification for a library which includes a mixture of 
peptides and peptidomimetics. 

Applicants maintain that the specification provides written description for the 
libraries of claims 8 and 16, which are made up of diverse "peptides and peptidomimetics." 
Firstly, the specification indicates that a library is a "collection of molecules" and does not 
indicate that the molecules must be of a uniform type. Secondly, as is explicitly set forth in the 
specification as originally filed, the term "peptide" is used in the subject application to include 
peptides, proteins, peptidomimetics and the like. Thus, in view of the lexicography of the 
subject application, a library of "diverse peptides" can include both peptides and 
peptidomimetics. Although Applicants maintain that it is clear to the skilled person that 
Applicants were in possession of the claimed invention including the invention as it would be 
practiced with a diverse library of peptides and peptidomimetics, claims 8 and 16 have been 
amended herein in order to further prosecution of the subject application without prejudice to 
Applicants' pursuing the subject matter of the original claims in a continuation application 
claiming the benefit of priority of the subject application. 

In view of the above remarks and amendments, the Examiner is respectfully 
requested to reconsider and remove the rejection of claims 8 and 16 under 35 U.S.C. § 1 12, first 
paragraph, as allegedly lacking written description. 

Regarding the enablement rejection 

The rejection of claims 1 to 3 and 6 to 17 under 35 U.S.C. § 112, first paragraph, 
as allegedly lacking enablement for the full scope of the invention, is respectfully traversed. 
Each of the grounds for rejecting the claims as allegedly lacking enablement is addressed in turn 
below. 
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Regarding the use of recovered molecules 

The previous Office Action indicates that the specification teaches but a single 
use for a molecule that homes to a selected organ: identification of the library member so that the 
homing molecule can be produced and used as a targeting ligand. In asserting that only 
identified end products have utility, the Office Action takes the position that the claims must be 
enabled for the further step of identifying individual library members. The Office Action 
acknowledges that the specification discloses several means (tags) for recovery of a homing 
molecule, including a physical support such as a microbead or phage, and biotin. However, it is 
alleged that the specification teaches but a single type of tag for identification of an individual 
library member, a nucleic acid molecule. 

In parent application 09/227,906, Applicants have argued, and the Examiner has 
acknowledged, that one would have been able to, use recovered homing molecules without 
knowing their structure. Applicants herein make of record in the present case the arguments 
presented in the response to Office Action filed April 6, 2000, in parent application Serial No. 
09/227/906. Specifically, Applicants maintain that identification of a recovered homing 
molecule is not necessary to practice the invention of claims 1 to 3 and 6 to 9. Applicants 
respectfully disagree that the only utility for the claimed invention is as an intermediate in a 
process for identifying the homing molecule. 

Specifically, a recovered "molecule that homes to a selected organ or tissue" is 
itself clearly useful as a binding reagent that can be linked, for example, to a detectable moiety 
such as a radiolabel for diagnostic purposes. In the same manner that a monoclonal antibody is 
valuable even when its structure has not been determined, a recovered homing molecule of the 
invention can be used as a specific binding agent even without knowledge of the structure of the 
molecule . Given that the organ homing specificity of the recovered molecule, for example, 
brain-homing specificity or kidney-homing specificity will be known, one skilled in the art 
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would have had sufficient knowledge of the homing molecule to use it as a binding reagent, for 
example, to identify an organ sample. 

In sum, Applicants maintain that a recovered "molecule that homes to said 
selected organ or tissue" has utility as a binding reagent with a known specificity, even without 
"identification" of the molecule or knowledge of its structure. In view of the above remarks, 
Applicants respectfully request that the Examiner remove this ground for rejecting claims 1 to 3 
and 6 to 17 under 35 U.S.C. § 1 12, first paragraph, as allegedly lacking enablement. 

Regarding selection of starting libraries 

As a further ground for rejecting the claims as allegedly lacking enablement, the 
Office Action asserts that the specification does not teach any and all possible libraries of 
organic compounds nor which libraries would be likely to yield homing molecules. Applicants 
have previously argued in parent application 09/227,906, and the Examiner has acknowledged, 
that one of skill in the art would have been able to prepare various types of libraries. These 
arguments and evidence are made of record below. 

There are two issues which support this ground for rejection. Firstly, the Office 
Action indicates that the specification does not teach how to make any and all libraries of organic 
molecules, although it is acknowledged that techniques for making peptide libraries as well as 
certain peptidomimetic libraries were known in the art. In this regard, the Office Action alleges 
that well-known techniques do not extend to any and all possible libraries of organic compounds, 
or to any and all possible libraries of peptidomimetics. Secondly, the specification allegedly 
does not suggest which of all possible organic molecule libraries other than polypeptides and 
peptidomimetics would be likely to yield at least one homing molecule, and further asserts that 
this alleged deficiency is not remedied by what was well known in the art. 
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1. Construction of any and all possible libraries 

As discussed further below, it is not necessary that the specification teach "any 
and all possible libraries" suitable for practicing the claimed invention. Rather, an invention may 
be enabled in view of what is well known in the art in combination with the teachings of the 
specification. In this regard, the claimed methods can be practiced without undue 
experimentation using, for example, non-peptide libraries including polymeric and 
non-polymeric libraries such as organic chemical libraries as set forth in the specification. Such 
a chemical library can be, for example, a library of natural product molecules, or a library of 
synthetic molecules such as a decorated monomer library or oligomer library. As evidence that 
non-peptide libraries such as chemical libraries were well known in the art at the time the 
invention was made, Applicants submit herewith Exhibit 1 by Ecker and Crooke, Biotechnology 
13:351-360 (1995), a review article indicating that a variety of chemical libraries were available 
for drug discovery in 1995. In addition, Gordon et al., J. Medicinal Chemistry 37: 1385-1401 
(1994), attached as Exhibit 2, describe construction of several nonpolymeric small molecule 
libraries including a benzodiazepine library and a hydandoin library (see pages 1390-1392). As 
further evidence that one skilled in the art would have been able to prepare organic chemical 
libraries without undue experimentation, Applicants submit as Exhibit 3 a publication by 
Burbaum et al., Proc. Natl. Acad. Sci., USA 92:6027-6031 (1995). Burbaum et al. produce two 
libraries, a dihydrobenzopyran library and an acylpiperidine library, to be screened for carbonic 
anhydrase-binding activity (page 6028, first column, last paragraph). The authors explicitly 
comment that "very large combinatorial libraries of small molecules on solid supports can now 
be synthesized" (see abstract). In sum, these publications corroborate that a variety of 
non-peptide organic chemical libraries, including non-polymeric chemical libraries, were known 
in the art at the time the priority application was filed. 

Furthermore, Applicants maintain that it is not required to disclose or provide 
evidence showing availability of "all possible organic chemical libraries." Rather, all that is 
required is that sufficient teaching be present in the specification such that, combined with what 
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is well known in the art, the invention can be practiced without undue experimentation. As 
indicated by the Examiner in the telephonic interview of August 11, 2000, pertaining to parent 
application Serial No. 09/227/906, the in vivo panning invention is a generic invention. As 
further acknowledged by the Examiner, no inventor can be expected to describe or be aware of 
all species of libraries encompassed by the claims . 

In sum, Applicants have provided evidence of the availability of a variety of 
non-peptide organic chemical libraries including non-polymeric organic chemical libraries. 
Furthermore, in parent application Serial No. 09/227/906, the same issue was overcome. See, for 
example, Paper #10, Office Action dated December 14, 2000, which indicates that "Applicant's 
arguments concerning the breadth of the 'diverse molecules'. . .are convincing" (page 5, second 
sentence). In view of what is taught in the specification and the corroborating evidence 
presented herein exemplifying that a variety of libraries were known in the art, Applicants 
respectfully request that this ground for rejection be removed. 

2. Selection of libraries which will yield homing molecules 

Applicants would make several additional points in response to the assertion that, 
since the claimed invention borders on a pioneering invention, it cannot have been routine in the 
art to make and test various types of libraries to determine the identity of types of libraries that 
would provide at least one member that would home to a selected organ or tissue. 

a. Firstly, in the methods of the invention, libraries of diverse molecules are screened based 
on the general expectation, borne out by practice as described further below, that a random 
library which is sufficiently large will contain at least one molecule that homes to the selected 
organ or tissue. Again, this expectation has been borne out by practice. In this regard, 
Applicants point to the Rule 132 Declaration executed by Dr. Erkki Ruoslahti as corroboration 
that one skilled in the art would have been able to identify homing molecules from a random 
library of diverse molecules without undue experimentation. In particular, the attached 
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Declaration indicates that several homing molecules were identified from a library of 75 small 
organic molecules randomly selected from a 420,000 molecule ChemBridge library. 

b. Secondly, Applicants submit that it would have been well within the level of one of skill 
in the art to mix several drug or organic chemical libraries before administering the "mixed" 
library to a subject, as specified in step (a) of the present methods. Mixing of libraries is taught 
in the specification, for example, in Example II, which discloses in vivo panning in mice where 
brain is the selected organ. As disclosed in the specification, 

Mice were injected with two different mixtures of phage libraries. 
The first mixture contained libraries encoding CX9 (SEQ ID NO: 

39), CX5C (SEQ ID NO: 36), CX 6 C (SEQ ID NO: 37) and CX7C 

(SEQ ID NO: 38) peptides (CX5.7/CX9 mixture; SEQ ID NOS: 

36-39). The second mixture contained libraries encoding 
X 2 CX 14 CX 2 (SEQ ID NO: 40) and X 2 CX 18 (SEQ ID NO: 41) 

peptides (X2CX!8/X2CXi4CX2 mixture; SEQ ID NOS: 40 and 

41). The phage library mixtures were administered to mice via tail 
vein injection. 

Specification at page 33, lines 16-26 

Applicants submit to the Examiner that it would not have been beyond the level of skill of one of 
average skill in the art to mix non-peptide libraries in the same manner as phage-displayed 
peptide libraries were mixed in Example II. Such mixing of libraries in order to create a more 
complex library to be administered to a subject obviates concerns regarding selecting one 
particular non-peptidic library to be used in practicing the claimed invention. 

c. Thirdly, focused libraries designed around a "lead compound" for optimization of binding 
and other characteristics were well known in the art at the time the invention was made. Thus, 
rather than administering a random library to a subject without relying on any information 
regarding molecules known to selectively home to the selected organ or tissue, one skilled in the 
art would have been able to use a library of diverse molecules focused around a known homing 
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molecule or a molecule known to bind to a target receptor expressed in the selected organ or 
tissue. Where the known homing or binding molecule is a peptide, such a focused library can be 
a library of diverse non-peptide molecules or a library of diverse peptide molecules. The use of 
such smaller, focused libraries was well within the level of the ordinary artisan at the time the 
invention was made. As stated above, one skilled in the art would have been able to prepare or 
select focused non-peptidic libraries, if desired, based on the structure of a known homing 
peptide or a peptide known to bind a target receptor expressed in the organ of interest. 

d. Fourthly, even if, for the sake of argument, one skilled in the art were to practice steps (a) 
and (b) with a particular library as set forth in claim 1 and were not to recover or identify a 
homing molecule, such would not preclude enablement. Rather, if the molecules recovered from 
a particular library did not contain a homing molecule, the performed procedure would not be 
encompassed by claim 1 . Similarly, if one skilled in the art were to practice steps (a) and (b) 
with a particular library as set forth in claim 10 and werb to fail to identify a homingmolecule, 
such would not preclude enablement. Instead, such a procedure simply would not be 
encompassed by claim 10. In this regard, it is well established that claims are permitted to 
embrace one or more inactive embodiments. 

In view of the evidence made of record above and further in view of the 
corroborating evidence presented in the attached Rule 132 Declaration, Applicants submit that 
undue experimentation would not have been required to select a library of diverse molecules 
appropriate for identifying a homing molecule using in vivo panning. Applicants would also 
emphasize that the issues of whether libraries would have been prepared by routine methods and 
whether libraries would yield homing molecules is the same for tagged and untagged libraries. 
Applicants maintain that one of skill in the art would have been able to select from a variety of 
libraries a library to be administered in the methods of the invention. In view of the above 
remarks, and further in view of the corroborating evidence discussed with the Examiner in the 
recent interview and submitted herein, Applicants respectfully request that the Examiner remove 
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this ground for rejecting the claims as allegedly lacking enablement under the first paragraph of 
35U.S.C. § 112. 

Regarding the Rule 132 Declaration of record 

* 

Applicants have previously submitted a Rule 1 32 Declaration showing in vivo 
panning results obtained with a 10-compound library. The Examiner believes that the 
Declaration describes a pre-defined library of 10 different benzodiazepines, and asserts that the 
specification does not teach using pre-defined libraries. Applicants wish to clarify that the 
library of 10 compounds includes multiple benzodiazepines, but also contains compounds with 
different structures. Thus, the work described in the Declaration does not rely on a "pre-defined 
benzodiazepine library." 

The Office Action further queries the multiple peaks observed for a single 
compound, and the inaccuracy of the molecular weights observed. In this regard. Applicants 
submit that the suboptimal quality of the starting library compounds used is consistent with 
additional peaks observed. Furthermore, the molecular weight inaccuracies observed likely 
resulted from mass spectrometric techniques which were not optimal for the size of the library 
components being analyzed. In particular, the spectrometer used in this experiment was 
calibrated for protein, rather than small molecule, detection. 

Finally, the Examiner notes that Figures 1 and 6 of the previously submitted 
Declaration are both from brain extracts yet reveal distinct spectra. Applicants submit that both 
Figures 1 and 6 show mass spectrometric results of brain extracts from mice injected with the 10- 
compound library. The distinct appearance of the two spectra is explained by the fact that each 
figure shows different portions of the spectra, with Figure 1 showing m/z from 268 to 310 and 
Figure 6 showing m/z from 220 to 250. 

Applicants maintain that the previously submitted Rule 132 Declaration supports 
that the invention can be practiced as claimed with untagged small molecule libraries. In 
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addition, the teachings of the specification have been further corroborated using a random library 
of 75 commercially prepared small molecules, as evidenced in the Rule 132 Declaration 
submitted herewith. 

Regarding enablement of untagged libraries of peptides 

Applicants have previously argued that one skilled in the art would have been 
able to identify peptides from untagged libraries and have provided Clauser et al. as evidencing 
that one of skill in the art would have been able to use, for example, HPLC and mass 
spectrometry to identify untagged peptides from a peptide library. For convenience, a copy of 
the previously exhibited publication by Clauser et al. is attached hereto as Exhibit 4. The Office 
Action responds to Applicants' arguments by asserting that the specification does not teach such 
methods or how to use them in identifying homing peptides from a library. 

First, Applicants submit that, in view of what was well known in the art at the 
time the invention was made, the specification provides sufficient guidance to practice the 
claimed invention using untagged peptides. In particular, the specification teaches that an organ 
extract can be separated based on molecular weights or polar or nonpolar characteristics or the 
like using a method such as high performance liquid chromatography (HPLC; page 8, lines 21- 
24). The specification further teaches that a highly sensitive detection method such as mass 
spectrometry can be used to identify organ homing molecules (page 8, lines 11-15). Thus, the 
specification provides guidance to the skilled person, teaching that molecules can be separated, 
for example, based on molecular weight or polar or nonpolar characteristics and subsequently 
identified using a technique such as mass spectrometry. 

The use of mass spectrometry to identify peptides is corroborated by the attached 
publication by Clauser et al., which states that mass spectrometry techniques "enable routine 
peptide sequencing from sample quantities of 100 ftnol to 10 pmol," citing references 1 1 and 12 
which were published in 1993 and 1994 (Exhibit 4 at page 5072, second column, first complete 
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paragraph). From the above, it is clear that in 1993 and 1994, in advance of the priority date of 
the present invention, one skilled in the art would routinely have been able to separate peptides 
by mass or charge and subsequently determine their peptide sequence using, for example, mass 
spectrometric analyses. In sum, Applicants submit that, in combination with knowledge of 
techniques which were routine in the art at the time the invention was made, the guidance in the 
specification would have been sufficient for one skilled in the art to practice the invention 
without undue experimentation. 

In view of the above remarks, Applicants respectfully request that the Examiner 
reconsider and remove this ground for rejecting claims 1 to 3 and 6 to 7 as allegedly lacking 
enablement. 
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CONCLUSION 

Applicants respectfully request that the Examiner enter the amendments and 
consider the remarks herein above. The Examiner is invited to call the undersigned agent or 
Cathryn Campbell if there are any questions relating to this application. 



Respectfully submitted, 



Date: May 26. 2004 CLuxLujlX, •^cuLUXa^J 

Andrea L. Gashler 
Registration No. 41,029 
Telephone: (858) 535-9001 
Facsimile: (858)535-8949 

McDERMOTT, WILL & EMERY 
4370 La Jolla Village Drive 
Suite 700 

San Diego, California 92122 



/REVIEW EXHIBIT 1 

Combinatorial Drug Discovery: 

Which Methods Will Produce the Greatest Value/ 

David J. Ecker* and Stanley T. Crooke 

bis Pharmaceuticals, 2292 Faraday Avenue, Carlsbad, CA 92008. 'Corresponding author: (e-mail: decker@isisph.com). 

Combinatorial strategies are important new approaches to drug discovery, and it seems quite likely that 
.^SS fhTKvery of tateresting potential pharmaceuticals. However, it is less clear whether 
^SS^SSSSSSSSSvaS^ quantum advances in therapeutics. Nor is there general agreement 
SX f^ m^ ^^ in defining how combinatorial strategies will provide vah.e to the 
of S review ' we propose criteria that define the value of 

^StoXAt^^aitegor^the various approaches by: (a) the type of chemical space to be 
^SSStM^SS^V^to synthesize and screen libraries, and (c) the structures of individual 
Scules to Ubraies We evaluate the strengths and weaknesses of the various strategies and suggest 
milestones that can help to track their success. 

The two basic challenges of drug discovery are to iden- that combinatorial strategies are likely to result in the identifica- 
SyTtaS ^ compound with desired activity and to tion of increasing number of pharmacologically active lead 
ootimize the lead compound to meet the criteria suf- molecules. 
firieTto proceed wi«h development. Traditionally, Far less consensus, however has developed with regard to 
EficatiWa lead compound has been achieved the answers to two critical and related questions: Firs , what are 
rite £ random SreeTTof natural products or the factors that will determine whether combinatorial strateg.es 
£e7ofZfeti?Scal S , or by Modifications create value as defined by the discovery of novel compounds 
of chernicafs™* kn^Thysiological a tivities-usually the with unique therapeutic properties? Second, which strategy 
^S^fcr E%£* receptor. Advances in medicinal and library synthesis and testing tactics are most hkely to yteld 

^^"^ ^"-wers to these two questions wU! emerg^m 

StiTscSninTcaX 'Sa»d " Jfch usually leads to fester The objectives of this article are to suggest a basis by which the 

desired activities, specifications, pharmacokinetic properties, evaluation. 

- £SSS3S!rS3St- — * — «-* «ha. Define , the «.,ue o, Drug 

ous advances in the identification and understanding of molecu- ^ number of drug candidates is ^ 

^Sd variety of new synthesisandscreeningmethodsare easier parameter to judge. This issue further- reduces to two 
currently grouped under the term "combinatorial" These meth- 
ods include parallel chemical synthesis and testing of multiple 

individual compounds or compound mixtures in solution, syn- TABLE 1 . Criteria th at define value. _ 

thesis and testing of compounds on solid supports, and bio- improvements in the quantity of drug candidates 

chemical or organism-based synthesis of biologic^ oligomers * 

coupled to selection and amplification strategies. The various # Inhibitors of muitistep complex processes 

methods have proliferated rapidly; each with its putative advan- v Rapid m0V ement 0 f leads into the clinic 
tages, disadvantages, and proponents. • ^at^move to development daftly 

Numerous development stage companies have focused on ta ^£ the quality and performance of drug candidates 

various combinatorial approaches and many major pharmaceu- v Ma j or improvements in receptor subtype srxscificity 

tical companies have established combinatorial programs. V Activity against targets where classical approaches have tailed 

^J^^s have been chscussed and compared in several V In^rovement in phannacoloneucs 

reviews 2 9 , and most scientists involved in drug discovery agree 
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monomers, and post-oUgomerization modifications provide the 
fine-tuning. 

Specific biological interactions occur between molecules 
where at least one of the partners is an oligomer (e.g. , epineph- 
rine and its protein receptor). The most specific biological 
interactions usually involve two or more oligomers (e.g., tran- 
scription factor complexes). Information in biological oligomers 
is more than a text string-type of encoded data, such as the 
genetic sequence information stored in DNA. In structural 
RNAs and proteins, the specificity is engineered by the pattern 



benzodiazepines 

— hydantolns 

piperazinediones 

blphenyls 

— sugar analogs 

— p - mercapto ketones 

— arylacetic acids 

— acylpipertcJnes 

- benzopyrans 

- cubanes 

- xanthines 

- aminimides 

- oxazolones 



of the display of the diverse side chains in space relative to each 
other and the common backbone to which they are attached. 

Traditional Medicinal Chemistry— 
The Second Combinatorial Strategy 

During the twentieth century, natural product and medicinal 
chemists have provided enormous libraries of compounds. Tra- 
ditional drug discovery has been, in essence, a stepwise combi- 
natorial search through the chemical structure space available to 
the practitioners of drug discovery. A common approach taken 
to optimize compounds to meet specific therapeutic objectives 
has been to add functionalities to lead structures, a process 
which we call "monomer decoration." For example, consider 
the history of adrenergic drug discovery (Fig. 1). Nore- 
pinephrine and epinephrine were shown to be natural agonists 
for the adrenergic receptors. As the properties of the adrenergic 
system and receptors were determined, it became clear that 
antagonists might have therapeutic value and then that isotype 
selective agonists and antagonists would be of considerable 
interest. Thus, during the past 40-50 years, the basic catecho- 
lamine structure has been progressively modified to create vari- 
ous classes of adrenergic agents. Extraordinary therapeutic 
advances and pharmacological tools enabled the progressive 
advances in understanding the adrenergic systems. 

Although the term monomer decoration may sound simple, it 
represents a highly sophisticated set of advances in medicinal 
chemistry structure activity optimization including sidechain 
modification, isosteric replacement and structural constraining 
and stretching. It is clear that the strategies taken by medicinal 
chemists have differed dramatically from those taken by evolu- 
tion. Novel combinatorial strategies can be categorized asthose 
that facilitate the search of spaces traditionally searched by the 
medicinal chemists (i.e., approaches that facilitate monomer 
decoration) and those that facilitate the search of oligomenc 
space These approaches can be categorized further on the basis 
of synthesis, compound identification and screening tactics. 

Combinatorial Drug Discovery Motifs 

A broad variety of very different strategies and tactics tall 
under the category of "combinatorial." Each method has certain 
strengths and weaknesses and some methods are differently 
suited to specific purposes such as better understanding the 
origin of life, diagnostic testing kits, gene therapy and therapeu- 
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tic drugs. We will focus only on the issues relating to value in 
drug discovery. 

Combinatorial drug discovery motifs are difficult to classify, 
but can be organized in a three-level hierarchy (Fig. 2): the type 
of space being searched, tactics to synthesize and screen, and 
the structures of the individual molecules in the libraries. The 
space searched can be divided into two groups: monomeric and 
oligomeric (Fig. 3). Although the two chemical groups do 
overlap, the chemical space for each is quite different. 

Searching monomeric space. Decorated monomers are 
combinatorial extensions of classical medicinal chemistry. As in 
the development of adrenergic receptor drugs (Fig. 1), a basic 
scaffold with multiple sites for substitution is decorated with a 
variety of diverse functionalities. Placing the diverse functional- 
ities in different positions relative to each other on the common 
scaffold creates new shapes. 

The shape of the scaffold defines the general geometry of the 
space that can be searched. The molecular weight range of 
molecules in decorated monomer libraries will be determined by 
the choices of diverse functionalities that are attached. Gener- 
ally, the molecular weight ranges are fairly tight, since all the 
molecules are based on a common scaffold. 

Recendy, a significant number of scaffold structures have 
been decorated in a combinatorial fashion (Fig. 2). The earliest 
versions produced relatively small arrays (-40) that were syn- 
thesized individually in parallel based on the benzodiazepine 
structure, which was already well known to have acceptable 
drug properties 7 - 10 - 12 . Also developed early were arrays of hydan- 
toins and dipeptides 10 . As the technology improved, the numbers 
of compounds based on a single monomer scaffold increased to 
approximately 1000 and the compounds were synthesized in 
small mixtures. Recently a library of about 1000 piperazine- 
diones was synthesized 13 . Also reported were small arrays of 
antioxidants 14 , and a family of arylacetic acids 13 . Molecular 
scaffolds based on aminirnide and oxazolone chemistries have 
been used to make arrays of individual compounds 1617 . A series 
of decorated bis-benzamide phenols were described in which the 
scaffolds can be attached to each other 1819 . Families of acylpi- 
peridine and benzopyran compounds have been made on a 
support containing encoding binary tags 20 . Two different scaf- 
folds based on cubane and xanthine containing four acid chlo- 
ride functionalities were used to create a library of 
conformational^ restricted structures 2122 . 

All but the last two motifs were synthesized on a solid 
support. The range of organic reactions that can be successfully 
conducted on support-bound material is currendy being 
explored 13 - 15 - 23 . The cubane and xanthine libraries were synthe- 
sized in solution and tested in complex mixtures. It is noteworthy 
that the decorated monomer motifs eked above have been 
screened using the whole range of screening formats including 
individual molecule testing, partial release from solid support, 
binary tags and iterative subset decon volution. In all cases, 
compounds were tested in solution. — 

Enzymes can be used to modify biological monomers, either 
alone or in combination with chemical synthesis methods. A 
variety of modified simple sugars have been constructed using 
this strategy 24 - 26 . A strategy has been considered in which 
extremely complex mixtures of biological catalysts acting on 
themselves and on a basic set of monomers create super- 
critical reaction conditions that can generate billions of organic 
molecules 27 . 

Searching biological oligomeric space. Combinatorial 
drug leads can be obtained by using natural biological mono- 
mers and biological systems to assemble and sort them. In this 
approach, the principles of molecular evolution are applied to 
the discovery of drug lead structures 28 . Molecular diversity is , 
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obtained by oligomerizing a relatively small number of mono- 
mers in a large number of ways. A selection strategy is imposed 
so that the most fit molecules survive a challenge that results in 
their amplification. In drug lead discovery, the challenge could 
be binding to a desired target. An amplification step is intro- 
duced so that molecules that meet the challenge are amplified 
and can be recovered. A measured number of mutations may be 
introduced in the hope of creating even more fit molecules and 
the process is cycled through multiple rounds. 

Biological systems that have evolved to do this can be recon- 
stituted in vitro. Currendy this has been achieved with RNA 29 ^ 0 , 
DNA 41 ^ 5 and proteins 46 " 32 . The wealth of success by many 
research groups in the identification of biological oligomers that 
bind to a variety of targets with high specificity is testimony to 
the power that in vitro biological systems have to generate a large 
number of shapes and the screening and sorting capacity of the 
selection scheme 29 ' 33 - 34 - 38 - 42 ^ 33 -" The ability to use directed 
molecular evolution to extensively search the structure space of 
biological oligomers is the great advantage of this method. 

The difficult part comes after identification of the oligomer. 
Biological oligomers have limited value as drugs. They are not 
stable in biological systems, have poor pharmacokinetic proper- 
ties and generally do not cross biological membranes, which co- 
evolved with biological oligomers for the purpose of enclosing 
them. Since the whole purpose of drug therapy is to manage 
biology it is fundamentally difficult to override its homeostatic 
mechanisms with the same molecules that it already knows how 
to manage without making some chemical alterations. 

Several strategies can be employed to overcome the natural 
limitations of biological oligomers after they have been selected. 
Mimicking biologically interesting peptide structures with phar- 
maceutically acceptable molecules has been a strategy pursued 
by many groups over the past 20 years 61 -* 3 . RNA can be stabi- 
lized to a certain extent by chemical modification 64 - 70 . Generally, 
an RNA molecule with a desired activity is first identified and 
then modifications are systematically introduced to increase 
stability. However, any post-selection (back end) modification 
strategy runs the risk of losing the original activity as changes 
are made. 
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sembled, and allows the chemical manipulations to be auto- 
mated. The synthesis of libraries of peptoids 11 *- 115 illustrates this: 
the backbone portion of each unit is provided by coupling bro- 
moacetic acid to the N-terminus of the oligomer, and the diverse 
functional groups are introduced by nucleophilic displacement 
of the bromide by a primary amine. The resulting N-substituted 
glycine terminus is now ready for the next elongation cycle. 
PNA is another chemical motif amenable to the generation of 
diversity through a submonomer approach. In PNA, one nitro- 
gen serves as the point of attachment for the variable functional- 
ity while the other is used to link the components together. The 
latter position can serve as the point of attachment for additional 
functional groups. 

We are currendy applying the principles of both the mono- 
mer and the submonomer approaches for the preparation of 
phosphoramidate libraries (Hebert et al. , in preparation). In this 
application, a hydrogen phosphonate monomer is coupled to the 
growing end of the oligomer, and the intermediate is converted 
to a phosphoramidate. A wide variety of amines can be 
employed in the reaction, and a large number of monomers have 
been prepared and are compatible with the synthesis conditions. 
By incorporating variability both in the linkage and in the back- 
bone portion, a high density of functional groups can be 
achieved. Different backbone segments are expected to allow 
control over the presentation of the functional groups and create 
different shaped epitopes. 

Overlap between monomelic and oligomeric space. 
Monomer decoration and chemical ohgomerization are funda- 
mentally very different approaches to searching space, but the 
shapes created by each approach can overlap. Non-peptidic 
monomelic scaffolds that attempt to mimic the shapes of pep- 
tides 62 - 63 are examples of overlap between the spaces. In a sense, 
much of the effort of rational drug design is an attempt to 
decorate monomeric scaffolds in ways that mimic (or comple- 
ment) the shapes of biological oligomers. It will be interesting to 
see the results of experiments where unbiased libraries of chem- 
ical oligomers and decorated monomers are screened for activ- 
ity against the same targets. Will the solutions provided by each 
approach be similar or dramatically different? 



There is also opportunity for mixed mode searches. If two or 
more decorated monomers are linked together, the structure 
begins to cross over to a chemical oligomer motif with a non- 
repeating monomer unit. Likewise, small branched or cyclic 
oligomers may significantly overlap the same space as decorated 
monomers. 

Useful diversity in combinatorial libraries. There are two 
distinct considerations in combinatorial library diversity: the 
number of molecules made per synthetic effort, and how differ- 
ent the molecules in the library are from each other (Fig. 4). 
Classical medicinal chemists produced one molecule for each 
synthetic effort. Although a single molecule cannot be consid- 
ered "diverse;* the molecule could fell anywhere on the land- 
scape of all synthetically possible shapes. The first generation of 
decorated monomers, the diversomers 7 , created more molecules 
at one time, but the molecules were very similar to one another. 
Newer generations of decorated monomers search shape space 
in clusters. Decorated monomers search the space defined by the 
geometry of the scaffold. The variety of ways to decorate any 
given scaffold provides for a very thorough search of the clump 
of space surrounding each scaffold. 

Biological and chemical oligomers provide an even broader 
search of shape space than decorated monomers because the 
length of the oligomer can be varied to cover a greater variety of 
sizes within one library and there are more options for combina- 
tions. Synthetic chemical oligomers can have a higher diversity 
than synthetic biological oligomers for several reasons: more 
chemical monomers can be created, the "repeating unit" of 
chemical oligomers need not be homogeneous and synthetic 
submonomer diversity can be introduced in addition to mono- 
mer diversity. However, at this point the number of molecules 
that can be produced outstrips the capacity for screening. 

Biological methods search the greatest total numbers due to 
their powers of selection and amplification. These very powerful 
search strategies allow a very thorough search of all possible 
shapes that can be created by biological oligomers. However, the 
number of monomers is restricted and the repeating units are 
homogeneous, so the overall chemical diversity can be consid- 
ered to be lower than the chemical methods. 
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A distinct advantage of combinatorial chemistry compared 
with natural products is that the complexity as well as the 
composition of the mixture can be controlled. Choice of library 
complexity is an important factor in the design of libraries. A 
balance must be maintained between diversity and the concen- 
tration of each individual species. If libraries are too complex, 
the concentration of individual molecular species may be too 
low to measure a pharmacological response. Another consider- 
ation is that the split-synthesis method results in a unique com- 
pound synthesized on each bead of solid synthesis-support 124 , so 
for all possible sequences to be represented, the number of 
beads must significantly outnumber the number of individual 
compounds produced' 25 . Thus, although we have the ability to 
synthesize subsets containing millions of compounds, it is prac- 
tical to limit complexity to the order of tens of thousands of 
compounds per library. 

Screening Strategies for Chemical Libraries 

Combinatorial screening strategies (Fig. 5) are nearly as 
diverse as the synthetic strategies. One of the major innovations 
of combinatorial technology is the strategic linking of screening 
and synthesis. In the past, natural product and chemical files 
were simply screened in functional assays: enzyme inhibition 
assays, receptor binding assays, whole cell assays, or animal 
assays. The pharmaceutical industry has developed a vast capac- 
ity to screen compounds or compound mixtures in functional 
assays. Moreover, functional assays select for the desired activ- 
ity, while binding assays may select for compounds that bind the 
target protein, but fail to modulate its activity. 

The simplest screening strategy currently considered "com- 
binatorial" is the parallel chemical synthesis of many individual 
compounds, which subsequently are tested individually. Histor- 
ically, this strategy has been used to screen chemical compound 
files. The innovation is completely on the synthesis end, and 
synthesis and screening are not strategically linked beyond the 
ability to conduct structure activity studies more rapidly than in 
the past. The major advantage is that this strategy is compatible 
with all existing assays. The major disadvantage is that the 
search rate is relatively limited. When screening capacity 
becomes limiting, individually synthesized compounds can 
always be pooled together and tested as a mixture. However, 
high-throughput screening methods are usually able to handle 
the yield of parallel chemical synthesis methods, in which case it 
is probably not worth invoking the added complications associ- 
ated with screening mixtures, such as the need to deconvolute 
back to a single compound. t 

To begin to tap the true power of combinatorial methods in 
functional assays, compounds must be screened in mixtures. 
The related paradigm is natural product screening, which pro- 
duced new compounds from a complex extract using analytical 
fractionation, retesting, purification and structural elucidation. 
Combinatorial methods have a number of conceptual advantages 
over natural product screening that can improve both the quality 
of the active compounds and the facility with which they are 
discovered. The chemical composition of active natural product 
is beyond control and requires structural elucidation to deter- 
mine if it has any value. In many cases the structures cannot be 
synthesized or are too unstable to be useful drugs or cannot be 
analoged successfully. The composition of combinatorial 
libraries is under the complete control of the medicinal chemist 
and unstable or unsynthesizable compounds are excluded from 
the library from the beginning. While natural product extracts 
are not predisposed to revealing their structures, combinatorial 
libraries can be synthesized in a fashion that facilitates determin- 
ing the structure of an active compound in a mixture without 
analytical separation, purification or structural characterization. 



Strategic linking of synthesis and screening strategies in 
functional assays provides a means for determining the struc- 
tures of active leads using various deconvolution strategies such 
as iterative subset deconvolution 93 * 99 126 - 130 , position scan- 
ning ,3U32 , and partial release from solid support 100 *' 01 . All of the 
above methods can be used in functional assays because com- 
pound mixtures can be tested in solution. 

Screening mixtures of compound has a number of complica- 
tions and caveats and it is far from clear which strategy is best. 
The success of screening extremely complex mixtures of com- 
pounds from natural product extracts in functional assays sug- 
gests that screening complex mixtures from combinatorial 
libraries should also be successful. However, the complexities 
produced by having more than one compound in the screen that 
arise in natural product drug discovery will also be encountered 
in screening combinatorial libraries. For example, there is 
always the chance for the activity of an active compound to be 
missed because of an interfering compound in the same mixture. 
There is the problem of multiple active compounds contributing 
to the measured activity in the mixture, which results in disap- 
pointment when after deconvolution no single molecule has high 
enough activity to be of value. Multiple actives may also make it 
difficult to find the best molecule in the mixture 133 . These com- 
plexities have always existed in natural product screening, and 
whether they will be more or less a factor in synthetic combina- 
torial libraries remains to be determined. An argument can be 
made that in combinatorial libraries these problems may be 
worse because the molecules that comprise the mixture are more 
similar to each other than they are in natural product extracts. 
Although this may be true for some combinatorial libraries, it is 
very difficult to make this comparison. Natural products often 
contain hundreds or thousands of different types of secondary 
metabolites such as alkaloids or steroid derivatives, yet specifi- 
cally active representatives from each of these classes have been 
successfully detected and identified. 

As the name implies, combinatorial drug discovery is a 
numbers game. The more drug discovery screens that com- 
pounds can be put through, the higher the probability of finding 
a lead. Thus, it is intuitively obvious that combinatorial strate- 
gies that are compatible with the many assays already estab- 
lished in the pharmaceutical industry will have a major 
advantage over strategies that require special assays to be devel- 
oped for each target. Many man-years went into establishing, 
de-bugging and validating these assays by the industry, and 
compatibility with established assays is likely to be the most 
important issue relating to value from the standpoint of lead 
discovery. Lead optimization, on the other hand, is a process 
that should be more tolerant of an investment in assay develop- 
ment once it has been determined that there is an interesting 

lead. , 

Binding assays provide another useful screening tactic that 
has broad utility in drug discovery, diagnostic testing, DNA 
sequencing and basic biomedical research. Binding assays have 
special advantages such as the ability to control the relative 
concentrations of the target and compound libraries that avoid 
some of the pitfalls of functional inhibition assays. Binding 
assays can be divided into formats where the compounds are 
fixed on a solid phase or formats where the compounds start in 
solution and upon binding their target are captured on a solid 
phase or resolved through a chromatographic or mass-based 
separation technique. The solid support strategies usually have a 
built-in method to identify the most active compound such as 
addressable maricies 102 - 105 ' 134 * 135 or pins 108 . Other strategies pro- 
vide built-in handles to facilitate identification of the compound 
on a bead 101 * 124,136,137 . The recent dramatic improvements in mass 
spectrometry methods and micro-separation techniques have 
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greatly enhanced identification of compounds bound to a macro- 
molecule even in the absence of tagging 138 . 

Conclusions 

Combinatorial drug discovery methods have proliferated 
rapidly and there is a broad consensus that they will provide new 
leads for drug discovery. It is far less certain that combinatorial 
methods will provide compounds that represent major therapeu- 
tic advances. In this article, we have considered the issues in 
combinatorial drug discovery that are most important in creating 
value. 

The value of any drug discovery strategy can be evaluated by 
determining the quantity and quality of the drugs that it pro- 
duces. Improvements in the number of drug candidates is deter- 
mined by both how rapidly lead compounds are discovered and 
how quickly they can be moved into the clinic. Improvements in 
the quality of drug candidates relates to activity against novel 
targets, increased specificity, improved pharmacokinetics and 
lower toxicity. 

Whether combinatorial methods will be able to achieve these 
improvements will be determined by issues of chemistry, screen- 
ing strategies and target applications (Table 2). The most impor- 
tant issues concerning chemistry are the novelty and quality of 
the space searched as well as the extent of the space searched. 
Search strategies in combinatorial drug discovery can be divided 
into monomelic and oligomeric strategies. Monomeric strate- 
gies follow the example of classical medicinal chemistry and 
search the same space that has been searched historically in a 
much more thorough fashion. The oligomeric strategy follows 
the paradigm of biology and searches novel space with a higher 
level of diversity. Biological oligomers can be searched in the 
greatest sheer numbers because of the power of directed molecu- 
lar evolution as a selection strategy, but are severely disadvan- 
taged by the amount of chemical modification required to make 
pharmaceutically acceptable compounds. Chemical oligomers 
have the highest overall diversity potential, but need to be proven 
to be pharmaceutically acceptable. The relative value of the 



TABLE 2. Factors that determine value in combinatorial drug 
discovery methods. 



Chemistry 



Screening 



Target Application 



Novelty and quality of 
space searched 

• Ability to 
continually make 
new structure 
classes 

• Ability to control 
physical and 
chemical properties 
of space searched 

Extent of space 
searched 

• Rate of synthesis of 
new structures 

• Ability to bias a 
search around a 
preselected space 

Time required to 
advance a lead 
structure to a drug 
candidate 

• Minimal chemical 
modifications 

• Rapid optimization 

Cost of searching, 
optimizing leads, and 
scaling up drug 
candidate 



Breadth of assays 
amenable to 
combinatorial method 
• Compatibility with 
existing 

pharmaceutical 
screens 



Facility with which_ 

libraries can be 

screened 

• Minimum of 

hurdles or special 

methods 



Facility of identifi- 
cation of a single 
compound from a 
large library 



Major therapeutic 
need 



Role of molecular 
target in the disease 
process 



Compounds can 
achieve required 
specificity for target 
receptor 



monomeric and oligomeric strategies will be determined by the 
success of each in making discoveries. 

Combinatorially discovered decorated monomers will likely 
yield the earliest drug candidates, but will result in a significant 
advance in therapeutics only if they target novel pathways br 
offer increases in specificity. The chemical oligomeric strategy 
is based on proven principles of biological specificity and pro- 
vides the greatest opportunity to search away from previously 
searched space. 

The most important issues in screening strategies are the 
breadth of assays amenable to the combinatorial method, the 
facility with which combinatorial libraries can be screened and 
the facility with which the method can identify a single com- 
pound from a large number of compounds. In order to tap the 
true power of combinatorial strategies, it would be desirable for 
large numbers of structures to be sampled and the synthetic and 
screening strategies must be strategically linked. With respect to 
these issues, methods that allow screening in the vast array of 
existing functional pharmaceutical assays coupled to synthesis- 
based separations, should they prove compatible, may have the 
edge over other methods. Binding-based separations have high 
utility when the target is known and available in high purity, and 
when there is interest in optimization of an interesting lead. 

Individual companies involved in combinatorial drug discov- 
ery have limited resources and have to place their bets on the 
chemical strategies and screening methods that they judge will 
produce the greatest value. Ultimately, value will be determined 
in the laboratory and in the clinic as these methods are used to 
attempt to create new generations of drugs. 
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A. Combinatorial Organic Synthesis 

The notion of creating huge, searchable libraries of small 
organic molecules is unprecedented in medicinal chemistry, 
and the possibility of doing so has recently captured the 
imagination of the drug-discovery community. The 
conventional paradigm of small molecule lead develop- 
ment, in which a compound undergoes many rounds of 
individualized, hand-crafted modification and biological 
testing en route to drug candidacy, will likely be dramati- 
cally accelerated by the application of combinatorial 
chemistry technologies to mass-produce and evaluate lead 
analogs. The ability to harness molecular diversity 
techniques as tools for lead discovery offers an unparalleled 
opportunity for medicinal chemistry to expand the breadth 
and scope of molecular structures that may be screened 
for biological activity. Widespread availability of collec- 
tions of highly diverse small-molecule libraries should 
provide an opportunity to assess the impact of combina- 
torial organic synthesis on new-lead discovery. In this 
section of part 2, some of the issues which confront the 
practitioner of combinatorial organic synthesis, as they 
relate to the problems of molecular recognition in general 
and medicinal chemistry in particular, will be analyzed. 

Issues in Practicing Combinatorial Organic 
Synthesis 

Combinatorial organic synthesis (COS) presents some- 
what of an intellectual inversion of the paBt 50 years of 
synthetic organic chemistry. The chemist of the Wood- 
wardian era was interested in a masterful and carefully 
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plotted natural product synthesis of a complex entity of 
known structure. Reactions were more often employed 
or developed to Bolve specific challenges rather than to 
provide generic methodologies. Rigorous control of reac- 
tion pathways, stereochemistry, and regiochemistry, and 
the exclusion of all but the desired diastereomer were 
obligatory in a faithful rendering of the technique. In 
contradistinction to natural product total synthesis, rather 
than generating a single, specific entity, the goals of COS 
are to create populations of molecular structures. Rather 
than exercising complete control, the combinatorial chem- 
ist, while maintaining high reaction efficiency and relative 
reactive compatibility, may actually seek to create situ- 
ations and apply strategies in which stereochemical/ 
regiochemical control is relaxed. This must be achieved 
while remaining cognizant of the impact these factors may 
have on the stoichiometry of the resulting libary and its 
design and ultimate use. Hence, the combinatorial chemist 
seeks to apply a series of Woodwardian reactions (reliable, 
high yielding) that operate generically on a diverse set of 
building blocks to provide a multitude of related products. 

Criteria for Library Design, The primary objectives 
of producing small-molecule libraries by COS are to provide 
collections of compounds suitable for both drug-discovery 
screening and drug-development optimization. When 
complete, the combinatorial drug-discovery exercise should 
have created a stable population of low molecular weight 
entities, free of reactive and toxicity-causing functionality. 
While a paramount medicinal chemistry design criteria 
for small-molecule-libary construction might be that the 
products of diversity generation (individual library mem- 
bers) should "look" like drug leads, of still greater 
importance is that the library actually contains compounds 
capable of interacting at some detectable level with the 
biological target of interest. When small-molecule leads 
for a target have been previously defined (e.g., benzodi- 
azepine Uganda for a peptide or other G-protein-coupled 
receptor, transition-state inhibitors for a protease), the 
notion of searching for more potent derivatives among 
libraries combinatorial^ enriched in specific pharma- 
cophore analogs is an obvious tactic to pursue. However, 
as the universe of well-defined macromolecular drug targets 
continues to expand through the impact of molecular 
cloning, the problem of identifying new pharmacophores 
capable of modulating the various interactions of peptides, 
proteins, carbohydrates, oligonucleotides, or lipids at these 
sites will also be intensified. 

Will "rules" about the types of libraries that may prove 
generally useful in ligand discovery be discovered? Al- 
though the field of molecular diversity has not yet matured 
to the point where substantial insight into this question 
is forthcoming, it is intuitively obvious that small-molecule 
libraries, whose members structurally resemble historical 
leads, should provide a fertile reservoir of potential 
molecular diversity. Tangential to this, natural products 
aside, numerous historical drug leads were derived simply 
because synthetic routes to these molecules were readily 
available. It is likely that early-stage COS will be limited 
by applicable chemistry and that this will necessarily focus 
work toward traditional leads, whose syntheses are known 
and well-documented. 

The successful identification from recombinant libraries 
of L- amino acid-based peptide Uganda that inhibit protein- 
peptide, protein-protein, and protein-carbohydrate in- 
teractions suggests there is broad utility in screening large 



Ubraries of peptidic compounds. It remains to be seen 
whether coUections of other random molecular structures 
that are quantitatively as diverse as existing peptide 
libraries prove in de novo ligand discovery to include the 
"pharmacophores of the future". 

Ligand rigidity may be another important parameter to 
consider in the course of library design. The incorporation 
of conformational constraints into flexible lead molecules 
has emerged is a powerful strategy to enhance ligand 
potency and/or selectivity, particularly in the field of 
peptidomimetic medicinal chemistry. 2 - 10 Nevertheless, 
with regard to library design, conformational restriction 
may act as a two-edged sword: an inappropriate constraint 
is likely to abrogate the modest but perhaps detectable 
activity of a more flexible analog, which could, in a 
secondary library, be systematically constrained. From 
the point of view of random screening, it remains to be 
determined whether useful leads will arise more frequently 
from Ubraries of rigidified or flexible structures. Data 
from the evaluation of cyclic peptide Ubraries in both 
synthetic and recombinant systems may provide some 
important insights into this issue. A number of methods 
have been recently described for on-resin cyclization of 
peptides through both main-chain and side-chain func- 
tional groups. 6 - 10 * 68 At present, a portfolio of Ubraries 
containing both conformationally rigid and relaxed mo- 
lecular diversity seems most appropriate. A longer range 
solution might be to moderate the high risk of confor- 
mational restriction by creating very large populations of 
semirigid molecular arrays, comprising structural families 
that collectively sample as completely as possible all regions 
of conformational space. 

Characterization. The usual measures of evaluating 
success in organic synthesis may lose meaning in COS. 
The classical notions of such fundamental concepts as 
purity/homogeneity, yield, exact product structure, relative 
and absolute stereochemical control, specific physical 
properties are less relevant when applied to a broad 
population of molecules (of course they may become quite 
relevant as individuals emerge from a selection process). 
Additionally, the analytical mainstays of the synthetic 
organic chemist, such as NMR and IR, may become 
obviated. The NMR spectrum of a 10 000-component 
library mixture is not diagnostic. The loss of these 
powerful tools requires that compensating technologies 
be developed. A major dilemma of COS is the difficulty 
of confirming the degree to which the expected chemistry 
has proceeded on the entire population of substrate 
molecules. Several groups have recently reported on the 
use of electrospray mass spectrometry as a technique for 
evaluating the bulk composition of diverse peptide li- 
beries, 11 - 12 Gross synthetic discrepancies, such as incom- 
plete protecting group removal, may be detectable by mass 
analysis, providing an opportunity to optimize the library 
synthesis protocols. In the characterization of combina- 
torial products, the presence of "byproducts* (in COS, 
unexpected products), combined with the difficulty of 
detecting these compounds, wiU cause problems if one 
mistakenly concludes that a screening hit is the expected 
product. This section will conclude by offering a potential 
solution to this problem. 

Efficiency/ Automation. Among the chemical criteria 
relevant for small-molecule-library design is the efficiency 
of diversity creation. The assembly of most small mol- 
ecules reduces to the intercombination of only three to 
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five building blocks of molecular weight ~150 each. 
Synthetic reactions capable of combining numerous 
building blocks simultaneously constitute a highly efficient 
form of diversity generation. Thus the Ugi four-compo- 
nent reaction has a high combinatorial efficiency since 
building blocks of four families (amines, carbonyl com- 
pounds, isocyanides, and suitable acid components) are 
linked simultaneously to afford a-amino acid derivatives. 
In contrast, peptide chemistry traditionally links two 
building blocks at a time. In both the broad screening 
and the lead analoging modes, a longer range question 
pertains to the ability of the chemistry to eventually be 
automated. Once the key decisions and overall strategy 
have been determined, much of the actual chemistry is 
repetitive in nature. Machines will continue to be 
constructed to capitalize on this and libraries will be 
assembled under computer control. 13 " 16 

Quantity and Quality of Diversity. While the 
"quantity of diversity" that is experimentally accessible 
can be dictated by the number of building blocks in the 
basis set and by the number of synthetic operations 
applied, or able to be applied (see part l 1 ), the practical 
limitations on library size are most generally imposed by 
the format within which the diversity is created and 
evaluated. A small number of building blocks subjected 
to many synthetic steps will yield high (numerical) 
diversity; however the products of these reactions may be 
relatively large molecules, not well-suited for lead devel- 
opment as traditionally administered therapeutics. Thus, 
as the combinatorial process proceeds, an opportunity 
window may exist in which the bulk of the library possesses 
properties which standard medicinal chemistry usually 
seeks in small-molecule drug discovery (MW < 700, 
solubility, etc.). Continued application of the combina- 
torial process will lead to product libraries containing larger 
molecules (composed of more building blocks) wherein 
the individual library members have "outgrown" the 
classical criteria of a lead-drug molecule. 

In surveying the historical landscape of drug discovery, 
there are particular pharmacophores or structural arrays 
which periodically surface far in excess of random chance 
(benzodiazepines, ^-lactams, imidazoles, phenethylamines, 
etc.)/ A review of recent successes in the era of "rational 
drug design" suggests that certain molecular concaten- 
ations — protein turn mimetics, conformationally restricted 
amino acids, transition-state analogs, dipeptide isosteres, 
molecular scaffolds, designed elements for enzyme 
inhibition — are often f ound in the medicinal chemistry of 
lead compound development. In consideration of the 
molecular structures which have left their mark on modern 
medicinal chemistry, one might conclude that the drug- 
discovery process is impacted not only by the sheer 
quantity of diversity surveyed, but additionally by the 
more subjective "quality" of diversity that is evaluated. 
Different organizations and individuals will certainly bring 
a wide variety of criteria to the subjective appraisal process, 
depending on style, experience, and bias. It may be 
speculated that the quality of diversity will be influenced 
by the sophistication of the building blocks originally 



* An interesting aside regarding these important substructures is that 
development of "generic" syntheses of key pharmacophores ultimately 
enabled facile generation of many analogs. Concurrently or subsequently, 
diverse biological activities were found among these compound classes. 
In a sense, this Is suggestive of combinatorial chemistry, except the crucial 
molecules were made serially rather than in a parallel/combinatorial high 
throughput fashion. 



introduced into the combining system (library bias on the 
part of the medicinal chemist) and the extent to which 
molecular substructures of the building blocks can be 
assembled in diverse, spatial (3-D) relationships. Thus 
the collected expertise of medicinal chemical knowledge 
may be used to "hyperevolve" or "bias" the libary by the 
planned introduction of commonly evolved elements; these 
elements are "retrocombinatorial synthons" of many 
known bioactive classes. Thus the building block basis 
set must be judiciously chosen and carefully attuned to 
the collected knowledge historically amassed in drug 
discovery. 

Issues in the Selection of Building-Block Sets 

The acquisition of a building-block library can be a major 
time and resource investment, and the eventual decision 
of which type of chemical building blocks to utilize places 
limits on the universe of structural diversity which 
ultimately can be explored. Depending on the specific 
objective, important building-block criteria include the 
availability of a large number of diverse, fairly complex, 
easily accessible starting materials. These may be either 
commercially available or prepared in a few steps from 
commercial materials. Members of a building block set 
should reflect a broad array of physicochemical properties, 
functionality, charge, conformation, etc. Building blocks 
may be chiral, achiral, or racemic. Certain building-block 
families have what may be termed a high "combinatorial 
potential". This relates to the high density per carbon 
atom of reactive functionality which can participate in 
new covalent combinations. For example, monosaccha- 
rides have high combinatorial potential since the high 
density of available hydroxyl groups leads to many 
potential connecting permutations. In addition to polymer 
formation, the high combinatorial potential of such types 
of building blocks may also be exploited as scaffolds for 
the generation of diversity (vide infra). 

Synthetic Strategy 

An important strategic element in combinatorial library 
synthesis is the degree of reliability of the ligand synthesis 
chemistry. What is the likelihood of general synthetic 
success with a particular reaction? The nature of com- 
binatorial reactions, which must proceed in the face of a 
broad range of functionality on a multitude of substrates 
and where the products are difficult to analyze individually, 
demands that, in selection of synthetic methodologies, 
greater weight must be given to reaction sequences with 
reliable, predictable outcomes. A more subtle question 
revolves around the number of synthetic options available 
in the course of diversity generation. For example, a 
synthetic strategy structured in such a way that, as the 
process proceeded, new combinatorial possibilities opened 
up, would be preferable to having options narrow, espe- 
cially if the goal was generating a maximum of structural 
diversity. 

As previously noted, there are two distinct themes that 
must be considered for the successful application of 
combinatorial technologies to ligand discovery and opti- 
mization, viz. broad-based screening and directed chemical 
analoging. The issues underlying conceptual design, as 
well as the synthetic strategies utilized in construction of 
these different classes of libraries, are noteworthy and are 
summarized in Figure 1. Building block requirements for 
undertaking broad and narrow diversity searches differ 
markedly. The search for an initial lead molecule may be 
essentially a random screening exercise, where the em- 
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Figure 1. Combinatorial chemistry: comparison of two major 
themes. 



phasis is on exposing a macromolecular drug target to the 
maximum possible structural diversity. The objective is 
to identify a ligand of significant affinity for the target, 
the exact ligand structure and its detailed characteristics 
at this point are not relevant: in fact any molecule will do. 
An approach to generating highly diverse libraries for use 
in medicinal chemistry might favor using building blocks 
which have distinguished themselves by appearing fre- 
quently in previous active leads (e.g., statine, hydroxy- 
ethylamine8, Preidinger lactams 16 ). On the other hand, 
once a lead is available, most drug discovery proceeds 
through a series of evolutions (optimizations) in order to 
meet a set of predetermined criteria. Since specific 
structural types are sought, searching in a very broad pool 
of diversity (as above) is unlikely to be successful (actually 
it could uncover a new lead but is less likely to optimize 
an existing one). Ideally, what is required in this type of 
diversity-generating strategy is to "explode" around the 
known lead, i.e., to create as highly diverse population as 
possible that bears close structural resemblance to the 
original hit, followed by a selection for desired criteria. 

Clearly the subunits which lead to predetermined 
structures must be quite specific: from where should 
building blocks for known structural classes of pharma- 
cophores arise? The answer, as in organic synthesis, lies 
in a retrosynthetic analysis or what we may term a 
retrocombinatorial approach to building-block selection. 
Lead structures should be retrosynthetically dissected in 
the maximum number of ways and upon these various 
possibilities imposed the needs of performing combina- 
torial chemistry. Inspection of the retrosynthetic tree 
invites the following key questions: By which modes of 
forward synthesis are the most building blocks available 
or obtainable? If the synthesis is allowed to proceed by 
that course, what is the scope and degree of reliability of 
the necessary reactions? Extending this line of reasoning 
should permit the maximum leverage to be applied 
combinatorially. 

A common feature of both paradigms is likely to be a 
reliance on solid-phase-synthesis methods to facilitate the 
assembly of combinatorial libraries. Synthesis on a 
polymeric support greatly simplifies the problem of 
product isolation from reaction mixtures and also facili- 
tates the partitioning of products into multiple aliquots 
for subsequent chemical elaboration. Moreover, the 
opportunity exists to take advantage of the support- 
tethered diversity in the design of convenient receptor 
binding assays for library evaluation. While there has 
been a long tradition of polymer-supported organic 
chemistry, 17 - 20 it is only in the areas of peptide and 
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Figure 3. Synthesis of N-protected nitrophenyl carbonate 
monomers. Key: (a) BH 8 , THF; (b) DCC, methylene chloride, 
iV-hydroxysuccinimide, HOBt; then sodium borohydride, ethanol; 
(c) p-nitrophenyl chloroformate, methylene chloride, pyridine. 

oligonucleotide synthesis that solid-supported chemistry 
has truly been optimized and become common-place. The 
advent of combinatorial organic synthesis will undoubtedly 
signal a renaissance in solid-phase organic chemistry, as 
workers attempt to adapt well- characterized homogeneous 
reactions to reliable solid-supported protocols. 

Progress to Date: Synthetic Polymeric Diversity 

The design and synthesis of novel synthetic monomers 
which, when assembled in a combinatorial fashion, could 
yield relatively low molecular weight polymeric materials 
is an approach that is well-suited to diversity generation 
and evaluation. Combinations of such monomers could 
lead to substances with novel backbones, possibly pos- 
sessing desirable properties, such as metabolic stability, 
enhanced pharmacokinetic profiles, and cell and mem- 
brane permeability. Identification of these and other 
potentially modifiable parameters in such systems could 
facilitate drug discovery. 

Schultz and co-workers have reported the synthesis of 
a library of oligocarbamates starting from a basis set of 
chiralaminocarbonates 21 (Figure 2). The monomeric units 
were readily obtained by the modification of amino acids 
via the intermediacy of the corresponding chiral amino 
alcohols (see Figure 3). The resulting nitrophenyl car- 
bonate building blocks (5) were stable for several months 
at room temperature. 

Oligocarbamates were synthesized on a solid support 
by deprotection of a resin-bound amine, protected with 
either the base-labile Fmoc or photolabile nitroveratryl- 
oxycarbonyl (Nvoc) group, followed by treatment with a 
nitrophenyl carbonate of type 3. The deprotection/ 
coupling cycle was repeated until an oligocarbamate of 
the desired length was attained (seven or eight cycles). 
Overall coupling yields were greater than 99% per step. 
Side-chain deprotection followed by resin cleavage af- 
forded the desired oligocarbamates (Figure 4). 

The VLSIPS photolithographic chip format, previously 
employed for oligopeptide synthesis, was used in the 
construction a spatially-addressable oligocarbamate li- 
brary of 256 members. An anti-carbamate monoclonal 
antibody served as a model receptor for screening against 
this array. Antibody:oligocarbamate complexes were 
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Figure 4. Solid-phase synthesis of oligocarbamates. Key: (a) 
nitrophenyl carbonate monomer, HOBt, diisopropylethylainine, 
NMP; (b) piperidine, NMP or hv t (c) acetic anhydride, NMP; (d) 
TFA, triethylsilane. 
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Figure 5. Comparison of peptide and peptoid backbones. 

detected by treatment of the chip with a fluorescein- 
conjugated secondary antibody, followed by analysis using 
scanning epifluorescence microscopy. Because the location 
and structure of each different library member is defined 
by the synthetic strategy (binary masking) used in this 
technique, the necessity of sequencing the products is 
obviated. The binding activities of putative hits were 
confirmed by conventional assays using authentic material 
prepared by independent synthesis. A preliminary evalu- 
ation of the physicochemical properties of oligocarbamate 
molecules indicate that they are more hydrophobic than 
the corresponding peptide homology and their expected 
resistance to several proteases was confirmed. 

Another type of synthetic polymeric diversity has been 
developed by Simon et aJ. 32 * 23 Through a variety of 
preparative routes, this group created a basis set of 
monomeric N-substituted glycine units, each bearing a 
nitrogen substituentsimilarto those of the natural a-amino 
acid side chains. The formal polymerization of these 
monomers results in a class of polymeric diversity which 
these workers have termed "peptoids" (Figure 5). Peptoids 
may be synthesized either manually or robotically following 
either a "full monomer" oligomer synthesis or via a 
"submonomer" synthesis, as reported by Zuckermann et 
al. u and illustrated in Figure 6. Various biological 
activities have been established for specific peptoid 
sequences, including inhibition of a-amylase and the 
hepatitis A virus 3C protease, binding to the tat RNA of 
HIV 22 , and antagonism at the ai-adrenergic receptor. 26 
The peptoid approach to diversity generation has been 
extended to the preparation of encoded combinatorial 
libraries, in which natural amino acids code for the 
structure of the peptoid chain 26 (see part l 1 and Figure 7). 

An important variant of the synthetic polymeric di- 
versity approach is directed toward construction of a 
chemical library in which the peptidyl backbone is 
conserved but a dipeptide unit is replaced at specific 
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Figure 6. Synthesis of peptoids. 

positions by a phosphonate dipeptide surrogate (see Figure 
8). Such phosphonate pharmacophores are well-known 
as transition-state analogs for amide bond cleavage and 
have found wide usage in the inhibition of metallo- 
proteases. 27-29 Campbell has described methods for the 
solid-phase synthesis of peptidylphosphonates that are 
compatible with the Fmoc/tBu protecting group strategy 
of standard peptide synthesis. 30 The key reaction step is 
formation of the phosphonate ester bond, which is achieved 
via a modified Mitsunobu condensation (Figure 9). 
Precursor lactic acid and protected amino phosphonate 
building blocks are prepared as shown in Figure 10. 

When this process is applied to the combinatorial 
synthesis of peptidylphosphonates, the diversity product 
will be a metalloprotease enzyme inhibitor library. En- 
zyme-inhibitor libraries of this type and those focusing on 
other known inhibitory pharmacophores (thiols, hydrox- 
amates, carbozyalkyldipeptides, etc.) may prove to be 
important tools in rapidly profiling novel proteases and 
for determining which pharmacophores are most effective 
at their inhibition. Using this knowledge, secondary 
inhibitory libraries can be constructed to optimize original 
leads. Through such a process it may be possible to 
dramatically accelerate the process of finding high-affinity 
enzyme-inhibitor Uganda. 

Another interesting type of polymeric diversity based 
upon a vinylogous polypeptide backbone has recently been 
reported by Hagihara et af., 31 in which introduction of a 
trans olefinic linkage between the a-carbon and the 
carbonyl group of various amino acids is generalized. 
Additionally, Smith and colleagues have synthesized a non- 
amide polymer of (3,6)-linked pyrrolin-4-one oligomers 
which mimic the 0-strand conformation of a normal 
peptide chain 32 (see Figure 11). 
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Figure 7. The synthesis of an encoded library consists of the 
following steps: (1) A Afunctional linker containing two or- 
thogonally protected points of extension (Na-Fmoc-Nc-Moz-Lys- 
0H) is attached to polystyrene resin via an acid-labile linker. (2) 
The solid support is divided into n equal portions at a mixture 
position. (3) A unique Na-Fmooprotected. non-natural monomer 
(B) is coupled to the "binding" strand. (4) A series of Na-Ddz- 
protected L-amino acids (C) are then coupled to the ^coding* 
strand, (5) The solid supports are recombined. 
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Nonpolymeric, Small-Molecule Diversity, The ma- 
jority of chemical diversity generation discussed above 
concerns the preparation of linear molecules, in which the 
target structures are unambiguously specified by the order 
of building-block addition. In contrast, .the great pre- 
ponderance of organic synthesis proceeds rather differ- 
ently, wherein building blocks interlock to give rise to 
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Figure 12. Components of a benzodiazepine library. 

nonpolymeric* three-dimensional arrays. The recent 
seminal work of Ellman on the solid-phase synthesis of 
^-benzodiazepines lays the groundwork for creation of 
a small- molecule library of one of medicinal chemistry's 
most notable pharmacophores and represents one of the 
first examples of the application of combinatorial organic 
synthesis to nonpolymeric organic compounds. 33 

The benzod iazepines are synthesized on a solid support 
by the connection of three building blocks, each of different 
chemical families (Figure 12). Following the attachment 
of 2-aminobenzophenone hydroxy or carboxy derivatives 
to the support using an acid-cleavable linker, [(4-hy- 
droxymethyl)phenoxyacetic acid] , the N- protecting group 
is deblocked (piperidine/DMF), and the weakly nucleo- 
philic amine is acylated with an or-Fmoc-protected amino 
acid fluoride, using 4-methy!-2,r>di-ter*-butylpyridine as 
an acid scavenger (Figure 13h Fmoc deprotection, followed 
by treatment with 5% acetic acid in DMF, causes the 
general cyclization to the intermediate lactam. Capital- 
izing on the ability of lithiated 5-(phenylmethyl)-2- 
oxazolidinone to selectively deprotonate the anilide NH, 
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Figure 13, Combinatorial synthesis of the benstodiazepine pharmacophore. 
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Figure 14. Synthesis of 2*aminobemophtmone derivatives on a solid support. 



aikylatkm was achieved with a variety of alkylating agents. 
Aqueous acid cleaves the new benzodiazepine from the 
support in very high overall yields. The integrity of the 
chiral center was confirmed by a racemization test using 
chira) HPLC. 

One of the limiting features of applying the above scheme 
to combinatorial library construction is that, though many 
alkylating agents and amino acid building blocks are 
commercially available, there is not a ready supply of 
appropriately functionalized 2-aminobenzophenones. Ell- 
man addressed this problem directly by creating a general 
method for preparation of these materials on solid 
supports 34 (Figure 14). The stage is now set for the Ellman 
laboratory to create a benzodiazepine library. 

Several other approaches to nonpolymeric molecular 
diversity have recently been published In pursuit of small- 
molecule libraries , Nikolaiev etal. have used their amino 
acid encoding format (part I 1 ) with a building block basis 
set combining both amino acids and other synthetic units 



to prepare collections of nonpeptidic compounds and 
peptides refractory to Edman degradation (N-blocked 
peptides) > m Representative examples of molecules which 
have emerged from such non-peptide libraries are shown 
in Figure 15. 

A feature of several of the formats used in the display 
of synthetic diversity is that the potential Hgands are 
tethered to a solid support. While screening strategies 
have been developed to exploit this feature, it is frequently 
desirable to screen compounds in solution. Many groups 
have engaged in developing releasable linker strategies to 
solubiliase potential ligands. The issue has been addressed 
by a considerably different strategy by Hobbs De Witt et 
aL, in which solid-phase chemistry, organic synthesis, and 
a designed parallel reaction apparatus were utilized for 
the generation of small- molecule libraries, the individuals 
of which, were termed u di verso mers". 36 Target compounds 
which included dipeptides, hydantoins, and benzodi- 
azepines were synthesized simultaneously but separately, 
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Figure 15. Structure of representative molecules from the 
Nikolaiev et al, nonpeptide library (ref 35). 
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Figure 16. Synthesis of a hydantoin library. 

on a solid support in an array format, to generate a 
collection of up to 40 discrete structurally related com- 
pounds. The preparation of hydantoins was carried out 
as shown in Figure 16. The synthetic strategy is directed 
through a resin-tethered penultimate product, in which 
the orchestrated revealing of distal functionality initiates 
attack on the resin-linking bond to eject the newly formed 
product into solution. Products which fail to react, should 
remain attached to the solid phase, and thus aid in product 
purification. The yields of hydantoins released from the 
resins in the final step ranged from 4 to 81% on a scale 
of 0.3-11 mg, which should be sufficient to support most 
preliminary in vitro biological testing. The resulting 
soluble, small molecules were characterized by traditional 
means. The authors also note the utility of 18 C gel-phase 
NMR to monitor reaction progress of the resin-bound 
intermediates. 37 ' 38 

In a similar manner, a general method for multiple, 
simultaneous synthesis of soluble benzodiazepines was 
developed (Figure 17). Eight groupB of five-amino acid 
resins were trans-imidated with five groups of eight 
2-aminobenzophenone imines to form 40 resin-bound 
imines. Treatment with TFA liberated 40 discrete ben- 
zodiazepines from the resins. The products were obtained 
in 2-14- mg quantities, corresponding to 9-63 % yields with 
estimated purities of >90%. Though the numbers of 
compounds involved in the diversomer methodology ( ~40) 
are significantly smaller than that which can be prepared 
by other library strategies (lOMO®), this interesting 
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Figure 17. Preparation of a soluble benzodiazepine library. 

approach to parallel organic synthesis produces relatively 
pure materials on a preparative scale in the traditional, 
soluble format. 

Future Innovations 

As the field of combinatorial chemistry receives in- 
creasing attention from the pharmaceutical establishment, 
it seems likely that the contents of chemical libraries will 
continue to evolve to look more and more like the type of 
compounds which have previously led to drugs. In spite 
of the complexity which early parts of the process may 
pose to the combinatorial chemist, a hidden advantage 
which combinatorially-derived molecules offer is that any 
"hit" will be readily synthesizable, by definition. This 
should be contrasted with a natural product driven 
approach to drug discovery and development, where often 
the structural complexity of the lead compound hampers 
the rapid preparation of analog molecules and the 
acquisition of SAR. 

A previous point deserving further emphasis is that the 
vast universe of synthetic organic reactions are idiosyn- 
cratic transformations that fail to afford quantitative yields 
of unique products. Most synthetic chemistry procedures 
afford multiple products (regio- and stereoisomers, etc.) 
in variable yields. If diversity-generating chemistry 
proceeds ambiguously, how then are the results of small- 
molecule combinatorial organic syntheses to be understood 
and appropriate information extracted from library analy- 
sis? It may be speculated that encoding techniques will 
provide one method by which the combinatorial organic 
chemist can address the practical inefficiencies of chemical 
synthesis. Instead of envisioning an encoding tag as 
explicitly specifying the structure of an associated entity, 
one might consider the tag as o record of the chemical 
history of individual library members. Thus, after 
encoding the "recipe" or synthetic protocol used in the 
assembly of a combinatorial library, the library may be 
screened for active recipes. Once identified as "active", 
the synthesis would be replicated on a preparative scale, 
and the product mixture fractionated to identify active 
product(B). This strategy shifts emphasis from the 
criterion of singularity in a reaction outcome (a single 
predictable structure) to reproducibility and compatibility 
(orthogonality) with chemistry used in the synthesis of 
the encoding tag and in preparative scaling. The creation 
of encoded, small-molecule diversity, which can be released 
from a support (solubilized) while some type of link to the 
original tag is also maintained, is also likely to be an 
important area of investigation. 
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Figure 18. Scaffolds as templates for combinatorial chemistry. 

One type of noteworthy chemical Btrategy which may 
have a bright future in the combinatorial realm rests on 
the conceptual extension of the work of Hirschmann, 
Nicolaou, and Smith and their co-workers into designing 
nonpeptidal peptidomimetics by the appropriate func- 
tionalization of designed scaffoldings (Figure 18). A 
specific example of this approach is the design and 
synthesis of a 0-D-glucose-based nonpeptide mimetic of a 
potent cyclic hexapeptide somatostatin agonist. 89 ' 40 

Hirschmann and co-workers have also used a function- 
alized steroidal template to serve as a backbone for 
mimicking a type 1 0 turn. 41 * 42 The fact that such a 
remarkable job of molecular mimicry can be achieved with 
individual compounds bodes well for the application of 
this approach to combinatorial methods. 

No library will ever be "complete" but instead will sample 
a subsection of a particular universe of molecular structure 
and space. In certain situations, libraries may be con- 
sidered to be starting materials for the construction of 
new libraries of diversity. It is also useful to consider 
chemical libraries as collectable or archivable entities. 
Ideally, one seeks to preserve new compound libraries and 
use them for a variety of present and future screening 
needs. As time passes, the combinatorial chemist will be 
in possession of an accumulating collection of molecular 
diversity with which to challenge new drug targets. If 
chemical libraries are to become an item of commerce, a 
good deal more will have to be learned about their "shelf 
life* and how best to store them for future use. To date, 
the shelf life of chemical libraries is an open question. 

B. Methods for Screening Combinatorial 
Libraries 

The importance of distinguishing between the two 
principal applications of combinatorial technologies in 
ligand discovery, i.e., broad screening versus directed 
analoging, is particularly relevant to the design of assay 
methodogies for library evaluation. In searching large, 
highly diverse libraries for novel lead compounds, a 
premium is placed on the ability to detect rare ligands 
that may have modest affinity for the target receptor. The 
assay strategy may differ in screening analog libraries, 
since one is trying to develop quantitative SAH on a large 
number of compounds and to increase the potency of a 
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lead. Regardless of the application, successful use of 
combinatorial libraries is highly dependent on the sen- 
sitivity and specificity of the assays that are used to identify 
and characterize ligands. 

In this section, the various combinatorial library meth- 
ods will be discussed in terms of the assays that are used. 
The assay formats are closely matched to the mode of 
presentation of the diversity. In broad terms, assay 
procedures can be grouped into three categories: (i) those 
that rely on affinity purification with an immobilized target 
receptor, (ii) those in which a soluble receptor binds to 
tethered ligands, and (iii) those in which soluble com- 
pounds are tested for activity, either directly or in 
competition assays. Each format presents different chal- 
lenges with regard to the minimum affinity requirements 
for ligand detection, the demonstration of binding speci- 
ficity, and the ability to discriminate among compounds 
in the library on the basis of their affinities for the target. 

Isolation of Ligands by Affinity Purification 

Recombinant Peptide Libraries. The various systems 
described in the first part of this series 1 for creating vast 
libraries of recombinant peptides (commonly referred to 
as peptide/nucleic acid complexes below) rely on affinity 
purification to select peptides that bind to a receptor. Two 
distinct methods have been employed to achieve affinity 
purification of peptide/nucleic acid complexes. The first 
involves incubation of a receptor in solution with the 
mixture of complexes. After allowing sufficient time for 
binding, the receptor is captured using immobilized 
streptavidin or an antireceptor antibody. 43,44 The second 
approach calls for preimmobilization of the receptor on 
beads, microtiter wells, or a chromatography support, 
followed by capture of the complexes. 46 In both cases, the 
use of a solid-support facilitates the separation and washing 
of receptor-bound complexes. 

The method of receptor immobilization is a critical 
aspect of the successful use of recombinant peptide 
libraries. Because of the tremendous levels of ligand 
enrichment attainable through multiple rounds of selection 
and amplification, peptides that bind to any component 
of the solid support can be isolated from libraries. Peptides 
binding to streptavidin, 46,47 antireceptor antibodies, 48 or 
peptides that exhibit inherent nonspecific binding are 
readily isolated. Often, screening strategies employing 
subtractive methods and/or blocking ligands, are necessary 
to enhance the selection of ligands with desired binding 
specificity. 

To enhance the probability of isolating peptide ligands 
with biological function, it is important to ensure that the 
receptor is active (for example, capable of binding its 
natural ligand) when immobilized. Immobilization of 
receptor proteins on microtiter wells or beads can be 
accomplished by passive adherence, covalent attachment, 
biotinylation and immobilization on streptavidin, or 
capture with high-affinity nonblocking antibodies. The 
first three processes often result in inactive proteins. The 
problem of immobilizing active protein can often be 
overcome by introducing into the receptor an immobiliza- 
tion handle through genetic engineering techniques. 
Peptide epitopes for a monoclonal antibody or a motif 
that allows for site-specific biotinylation of the protein 40 
can be fused to proteins for this purpose. Generic 
immobilization strategies of this type greatly facilitate the 
creation of a high-density affinity matrix suitable for 
isolation of ligands. 
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Some drug-discovery targets may not be readily available 
as pure soluble receptors. Peptide ligands specific for the 
integrin Ilb/IIIa have been successfully isolated from phage 
libraries by screening against platelets expressing a high 
density of this receptor. 50 It remains to be seen whether 
whole cells bearing receptors or other forms of impure 
macromolecular target will generally be successfully 
utilized to identify ligands. It can be anticipated that the 
problem of isolating non-receptor-binding sequences will 
be enhanced when dealing with impure forms of receptors. 

The choice of using preimmobilized receptor rather than 
incubation with receptor in solution followed by receptor 
capture on a solid support may become important in one 
aspect of library screening. Because the phage and Lad/ 
DNA complex systems are capable of displaying multiple 
copies of a peptide, multivalent binding can occur during 
affinity purification if receptors are immobilized at 
sufficient density. Multivalent binding effectively in- 
creases the avidity of the bound complexes and allows the 
isolation of complexes bearing peptides of lower affinity. 
Although it is possible that multivalent interactions may 
occur, during the capture step of the two-step procedure, 
the stoichiometry of the phage or Lad complex, receptor, 
and the capture reagent need to be carefully controlled. 
Multivalent binding conditions may be more easily 
engineered with a high-density preimmobilized receptor. 

The multivalency of the phage and La<?I systems can be 
exploited to isolate peptides of modest affinities (K& values 
of 1-1000 mM). This feature is particularly important in 
screening random libraries of peptides greater than six or 
seven amino acids in length. Libraries that can be routinely 
made have many fewer members (lO^lO 10 ) than the 
theoretical number of possible sequences for a given 
peptide length (for example, there are 1 X 10 13 possible 
10-mers). In general, peptide ligands for a receptor target 
consist of families of related sequences with few high 
affinity ligands in the family. Therefore, when a library 
is created, it is likely that only lower affinity members of 
the family will be represented. The ability to identify 
these low-affinity ligands then permits one to proceed to 
the secondary phase of discovery, the screening of mu- 
tagenesis libraries. 

Methods for creating many variants of an initial 
sequence have been described in part 1. Such libraries 
will generally contain many low-affinity ligands and 
perhaps some high-affinity ligands in much lower abun- 
dance. A demand is therefore placed on the ability to 
selectively isolate the highest affinity ligands. Several 
methods have been described for efficient affinity selection. 
All are based on the prevention or disruption of multivalent 
interactions. The use of a low density of immobilized 
receptor to isolate high-affinity peptide ligands from a 
phage library containing many low-affinity ligands has 
been demonstrated. 44 Low receptor density reduces the 
possibility of multivalent interactions between phage 
particles and the receptor matrix. The "monovalent- 
phage" approach 45 ' 61,62 has been successfully employed to 
isolate high-affinity mutants of human growth hormone 
displayed on phage. In this approach, phage particles with 
only a single chimeric pill protein are created, thereby 
eliminating multivalent binding to immobilized receptor. 
Using this approach, mutants with K& values of less than 
5 pM have been identified. 

An alternative method of affinity selection that may 
have advantages over other approaches has been devel- 



oped. 48 ' 53 Phage (or Lad particles) are allowed to bind in 
a multivalent fashion to a high density of immobilized 
receptor. For phage-bearing low-affinity ligands, the 
peptide on an individual pill protein may be rapidly 
dissociating and reassociating, but the phage particle will 
not dissociate unless all the peptides on pill are simul- 
taneously in the unbound state. Dissociation of the phage 
can be initiated by addition of a competing ligand, which 
prevents rebinding of any individual peptide in the 
complex. Using a model system with peptides of known 
affinity, it was demonstrated that phage-bearing high- 
affinity peptides are retained for a greater length of time 
than phage with lower affinity sequences. 48 The concen- 
tration (and affinity) of the competing ligand, as well as 
. the time and temperature of elution, can be varied to select 
for ligands of various affinities. This method has the 
advantage of using a high receptor density to ensure a full 
sampling of ligands of all affinity classes. 

Achieving affinity selection is only part of the process 
of successfully screening recombinant peptide libraries. 
After selection, it is necessary to establish the binding 
specificity and, if possible, the affinity of individual 
peptides that result from the selection. Various assays 
have been described, including dot blots, 64 colony lifts, 66 
and ELISA's with immobilized phage or immobilized 
receptor. 44 These methods differ in the minimum ligand 
affinity that is required for detection. In general, assays 
in which phage or Lad are immobilized (ELISA's, dot 
blots, colony lifts) require higher affinity (K& values < 1 
fiM) peptides for detecting specific binding, and are 
therefore useful when such ligands are present in the 
selected pool. However, for reasons cited above, detection 
of the specific binding of lower affinity ligands is often 
necessary. In such cases, assays that use a high density 
of immobilized receptor are required to allow for multi- 
valent binding and to increase the sensitivity of detection. 
If high-density receptor matrices are used for affinity 
purification and assay of individual clones, peptides with 
Kd values as high as 100-500 fxM can be isolated with 
phage and Lad systems. 48 

An additional assay format has been described for 
estimating the affinity of peptides displayed by individu- 
ally selected phage clones. 44 Radiolabeled receptor is first 
allowed to bind to the phage-borne peptides in solution. 
A high concentration of competing peptide is then added 
to prevent further binding, and the dissociation of ra- 
diolabeled receptor is followed with time. With a mono- 
clonal antibody model system, a good correlation was 
observed between dissociation rates and the affinity of 
the peptide determined by solution methods. It has also 
been suggested that colony lifts with limited receptor 
concentration may allow discrimination of individual 
phage clones on the basis of their peptide's affinity. 66 
However, this method may be confounded by differential 
levels of expression of phage by different colonies. 

In summary, the successful identification of ligands from 
recombinant random peptide libraries depends not only 
on the nature and size of the libraries but also on effective 
screening strategies. Selection methods and assays of 
individual clones vary in their ability to select and detect 
lower affinity peptides and in the ease with which binding 
specificity can be determined. High-affinity ligands are 
most desirable, and initial conditions for screening of 
random libraries can be adjusted so that only high-affinity 
ligands are selected. However, for reasons stated above, 
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engineering selection and assay conditions to allow isola- 
tion and detection of lower affinity ligands may be generally 
a more reliable strategy. These initial peptides can serve 
as starting points for creating secondary recombinant 
peptide libraries or as leads for refinement by synthetic 
chemical combinatorial methods. 

Affinity Purification of Mixtures of Soluble Syn- 
thetic Compounds. Investigators have employed affinity 
purification methods to isolate ligands from mixtures of 
soluble peptides 26 ' 66,67 and oligonucleotides (RNA or 
DNA). 68-60 In the case of nucleic acid libraries, one takes 
advantage of the ability to enzymatically amplify the 
molecules resulting from affinity purification, and as with 
the recombinant peptide systems, multiple rounds of 
selection and amplification are used. Theoretical con- 
siderations in optimizing conditions for the selection of 
high-affinity oligonucleotides have been described. 61 The 
authors illustrate (by way of computer simulation) the 
importance of nucleic acid and receptor concentrations as 
well as the efficiency of separating specifically bound 
molecules. Computer simulations show that, under ideal 
conditions, rare high-affinity molecules can be isolated 
from large libraries with relatively few rounds of selection 
and amplification. There have been a number of examples 
of successful identification of high-affinity oligonucleotides 
using this process, 82 

Relatively little work has focused on the affinity 
purification of ligands from soluble peptide libraries. The 
isolation of ligands for an anti-gpl20 antibody from 
equimolar mixtures containing 19 or 32 peptides has been 
reported. 56,57 More recently, the same antibody was used 
to capture ligands from four mixtures, each of 50 peptides, 
comprised of unnatural amino acids fused to an encoding 
L-amino acid peptide strand. 26 After affinity purification, 
the resulting pool of peptides selected was resolved by 
HPLC and each peak subjected to Edman sequencing and 
mass spectrometry analysis. A major limitation of this 
approach is the sensitivity of these analytical methods. 
Sufficient peptide (> 1-10 pmol) must be recovered in order 
to determine its sequence, requiring that each library 
member be present in relatively high amounts in the 
starting pool and that there be sufficient receptor available 
to isolate the requisite quantity of each of the high-affinity 
ligands. In addition, the method requires that selected 
peptide ligands be resolved chromatographically. While 
it seems unlikely that this methodology will be extended 
beyond libraries of modest size (less than a few thousand 
members), it may prove a useful technique for evaluating 
secondary (analog) libraries. The proposed approach of 
creating a library of soluble compounds with attached 
oligonucleotides tags may allow for the structural iden- 
tification of minute quantities of compounds isolated by 
affinity purification. 63 * 64 

In theory, chromatography of compound mixtures using 
receptor columns should not only facilitate separation of 
nonbinding members of the library, but should also allow 
for the resolution of compounds on the basis of their 
receptor affinities. Work with various model systems has 
demonstrated that column retention time can be used as 
an index of affinity. 65,66 While columns of receptor target 
have been used in batch affinity purification methods, 
chromatography to resolve ligands of differing affinities 
has yet to be applied to screening combinatorial libraries. 
This method may be better suited to isolation of ligands 
of moderate affinity. 67 An additional limiting factor in 



the use of chromatography may be the large amount of 
receptor required to generate enough theoretical plates to 
effectively resolve compounds. 

Binding of Receptors to Immobilized Ligands 

Various methods for creating libraries of compounds 
attached to solid supports (pins, beads, chips, etc.) have 
been outlined in part l. 1 Such libraries are screened by 
detecting the direct binding of a labeled receptor to an 
immobilized ligand. The identify of the ligand is either 
determined directly (by peptide sequencing or mass 
spectrometry), specified by its spatial location in an array, 
or deduced by reading an encoding tag. 

There are a number of important issues related to solid- 
phase binding assays with immobilized ligands. First, the 
ability of a receptor to interact with a tethered ligand may 
be influenced by the site or nature of its covalent 
attachment to the solid support. In all of the methods 
published to date, peptide ligands are attached to a linker 
and support via the carboxy terminus of the sequence. An 
obvious example of the limitation imposed by this mode 
of immobilization would be in screening against the 
G-protein-linked receptors of various peptide hormones, 
many of which require a free C-terminal carboxamide for 
activity. In such a case, it is likely that many peptide 
analogs that would bind when free in solution would be 
missed in an assay where the same peptides were im- 
mobilized via their C-termini. To circumvent this problem, 
it is advantageous to have several alternative sites of ligand 
attachment to the surface. Methods for tethering peptides 
through their N- termini have been identified. 68 It is likely 
that the issue of how best to tether molecules to surfaces 
will become even more important when dealing with 
libraries of small nonpolymeric organic compounds. 

The chemical nature of the linkage between the ligand 
and support may also affect the receptor-ligand interaction. 
One needs only to look at the variety of resins that are 
available for affinity chromatography to appreciate the 
importance of controlling the receptor-ligand interface. 
The types of linker groups that have been successfully 
employed in tethered library assays to date have been 
noted in part 1 of the series. Whether these linkers will 
generally provide for optimal presentation of compounds 
to other receptor systems remains to be seen. 

Immobilized ligand assays require that the receptor be 
labeled in a way that allows for highly sensitive detection 
of receptor binding. The receptor can either be labeled 
directly or a secondary labeled reagent with high affinity 
for the receptor can be used. To date, colorimetric 
enzymes, radioisotopes, and fluorophores have been used 
in labeling receptors or secondary reagents. The reagents 
.must be labeled in a way that maintains the activity of the 
receptor, for instance, its ability to bind a natural ligand. 
This can be greatly facilitated by creating chimeric 
recombinant receptors that incorporate peptide epitopes 
of antibodies or peptide sequences for site-specific ra- 
dioactive phosphorylation 69 or site-specific biotinylation. 49 

Successful screening of libraries of immobilized synthetic 
ligands is dependent on the same types of issues as have 
been previously discussed with respect to evaluating 
recombinant peptide libraries: i.e., the affinity threshold 
for detection, the ability to discriminate ligands on the 
basis of their affinities, and the ability to distinguish 
specific binding from nonspecific binding. Methods 
development in this area is in its infancy. In principle, it 
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should be possible to exploit multivalent binding to detect 
lower affinity ligands. Multivalent receptors can be 
created by a number of methods, including genetic fusions 
to generate bivalent receptor/Fc fusions 70 or through the 
use of monoclonal antibodies or streptavidin to create 
cross-linked receptors capable of interaction with more 
than one immobilized ligand. Optimization of the density 
of immobilized ligands may be required in order to allow 
for multivalent binding. As has already been noted, it 
may be important to be able to isolate relatively low-affinity 
ligands in the initial screening of random libraries. These 
compounds can then serve as the basis for further library 
construction in which the goal is to improve ligand affinity. 
Affinity discrimination during the screening of either 
primary random libraries or secondary (analog) libraries 
is of obvious importance. There has been little published 
work on methodology in this area. In principle, low 
receptor concentrations, competing ligand-mediated dis- 
sociation, and/or stringent washing conditions can be 
utilized to identify the highest affinity ligands. Two issues 
complicate the use of such methods. The first is the 
likelihood that each pin, bead, or surface synthesis site 
does not contain the same amount of compound. With 
different compound loadings, one must be extremely 
cautious of using the absolute quantity of bound receptor 
as an index of a molecule's affinity. As new building- 
block and coupling chemistries are adapted to combina- 
torial formats, this may become a more significant problem 
than it is for high-yielding peptide chemistry. Another 
complicating feature of the immobilized ligand assay 
format is the fact that ligands of one particular kind are 
densely clustered on a surface. Both the association and 
dissociation rate constants of a receptor/ligand interaction 
are affected by surface ligand density. The binding of 
nearby ligands depletes the local receptor concentration 
and the association kinetics become diffusion limited. After 
dissociation, receptor rebinding is favored because of the 
high local-ligand concentration and the apparent dis- 
sociation rate is reduced. Theoretical and experimental 
analyses of these surface binding effects have been 
undertaken. 71 The impact that these surface binding 
kinetics will have on the ability to discriminate among 
library members on pins, beads, or glass surfaces remains 
to be seen. 

The information generated by screening immobilized 
ligand libraries differs among the various library formats. 
In the case of bead-based technologies, compounds 
exceeding a threshold affinity are sampled from a large 
pool of ligands. Positive information is obtained, i.e., that 
a particular ligand binds to the receptor. One cannot, 
however, draw conclusions about the binding affinity of 
nonselected ligands. The sampling size may not have been 
large enough to include all high-affinity ligands, or a high- 
affinity bead may have been missed by the affinity selection 
method [for example, fluorescence-activated cell sorter 
(FACS) selection] . By contrast, the multipin and VLSIPS 
technologies allow one to perform a parallel assay in which 
data is obtained on every compound that is synthesized. 
In principle, both positive and negative binding informa- 
tion can be exploited in the design of second-generation 
compounds. 

Incorporation of methods that assess the specificity of 
binding of ligands is an important aspect of screening 
random libraries. Screening immobilized ligands by direct 
receptor binding can lead not only to the identification of 



ligands of interest (for instance, ligands that compete with 
the natural ligand) but also to ligands that bind to 
undesired portions of the receptor or to secondary detec- 
tion reagents. In the case of libraries of compounds on 
beads, it may be possible to remove undesired ligands in 
a subtraction step prior to screening for desired ligands. 
For compounds on pins or chips, it may be possible to 
make replicate arrays and test for total binding and 
nonspecific binding in parallel. Otherwise, sequential 
assays that first test for receptor binding of any kind, 
followed by an assessment of nonspecific binding will be 
required in order to correctly identify compounds that 
interact with the receptor in a desired manner. 

Testing the Activity of Libraries of Soluble 
Compounds 

The classical method of screening for a desired biological 
activity is to test soluble compounds one at a time in a 
competition binding assay, enzyme inhibition assay, or in 
a cell-based bioassay. Such approaches have been applied 
to library screening by releasing compounds synthesized 
on pins into microtiter wells, as described in section C of 
part 1 of this series. 1 A novel application of bead 
technology has recently been disclosed where compounds 
on individual beads are released locally onto a lawn of 
confluent mammalian cells and cause activation of cells 
in the area surrounding the bead. 72 The bead responsible 
for cell activation is isolated and a small amount of 
noncleaved peptide is sequenced to determine its structure. 
In both of these cases, the principal issue is whether enough 
compound is released to be detected by the assay. For 
pins, approximately 100 nmol of peptide can be released 
into a few hundred microliters of solution, while beads 
with diameters of ~ 100 can release on the order of 100 
pmol of peptide. 

Rather than assaying compounds individually, a second 
approach to screening soluble libraries is to assay com- 
pound mixtures. In addition to testing complex pools of 
soluble peptides (vide supra) , libraries of oligonucleotides 
have been successfully screened as soluble mixtures. 73 The 
most frequently used strategy for screening mixtures of 
soluble compounds with the goal of ultimately identifying 
single active molecules is based on the "mimotope" 
approach, detailed in part 1. The essence of this strategy 
is that degenerate pools of peptides (or other compounds) 
are resolved into their most active constituents by an 
iterative process of testing and resynthesis until a single 
sequence is identified as having high activity. A variation 
of the methodology (termed "bogus-coin strategy") has 
also been described. 74 

There are a number of caveats to using this methodology 
for testing soluble compound mixtures. In practice, the 
results of each set of assays do not typically indicate a 
preference for a unique residue at any position within the 
sequence, Rather, comparable assay results may be 
obtained for several different amino acid substitutions 
and some decision must be made as to which of these 
partial solutions should be fully resolved. The number of 
peptide mixtures to be synthesized and tested in this 
protocol expands dramatically as the number of alternative 
sequences selected for complete resolution at each cycle 
is increased. Moreover, the deconvolution of different 
partial solutions may frequently produce divergent re- 
solved sequences, in part because the contribution of each 
amino acid to the peptide-receptor interaction is typically 
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dependent on other non~neighboring residues within the 
ligand. The problem of identifying the most potent ligand 
in a complex mixture by this type of iterative pathway is 
exacerbated by the relative abundance of lower affinity 
ligands that represent local binding optima. 

Originally designed for identifying antibody ligands, the 
mimotope strategy has primarily been used for libraries 
of six to eight building blocks in length. It is not clear that 
ligands of this s&e will be optimal for other types of 
receptors (although success with opioid receptors 75 and 
other targets have been reported). As the length of the 
compounds in the library increases, resyn thesis and testing 
of pools becomes more cumbersome. 

Perhaps the greatest limitation of this methodology is 
the fact that the activity of a given pool is based on the 
cumulative activity of all the compounds in the pool; i.e., 
pools with the same activity may contain many low-affinity 
compounds or a few high-affinity compounds. For this 
reason, the methodology is greatly facilitated if the minimal 
fragment having activity is comprised of the same number 
of building blocks as used in constructing each library 
member (e.g., a uniquely active tetramer is more easily- 
resolved from a tetrapeptide library than a hexamer 
library). Alternatively, the identification of active peptide- 
(s) is facilitated if the receptor has specific requirements 
for a fixed position within a peptide ligand (e.g., the N or 
C termini). If neither of these conditions is true, it may 
be necessary to test many or all of the possible initial pools 
with two adjacent or nonadjacent fixed residues. This 
drastically increases the number of initial pools that need 
to be synthesized but increases the probability that a 
critical residue(s) is fixed in at least one pool to allow that 
pool to differentiate itself. It must be kept in mind that 
any pool identified as having the greatest activity may be 
composed of many moderately active compounds and that 
the most active compound (s) may reside in other pools. 

Testing of mixtures of soluble compounds is also limited 
by the concentration of individual test compounds that 
can be achieved in the initial pools. Pools containing as 
many as 160 000 different peptides have been tested with 
each member being present at —10 nanomolar concentra- 
tion. 76 Because of limitations on the solubility of the total 
pool, the concentrations of individual compounds present 
in increasingly larger libraries must be correspondingly 
diminished. This will ultimately limit the ability to 
identify the activity of compounds with modest potencies. 

While the current methods for testing mixtures of soluble 
compounds have certain drawbacks* screening soluble 
libraries does have the decided advantage of avoiding the 
problems associated with assaying tethered molecules in 
other combinatorial technologies. Conventional binding 
and enzyme and cell-based assays (including those with 
poorly defined biochemical targets) can be used to test 
the activity of soluble compounds. It is likely that in the 
future, encoding strategies will be employed to allow more 
facile screening of soluble molecules. In the simplest 
format, single encoded beads can be dispensed into 
microtiter wells. The compounds can then be released 
from the beads and tested for activity, with the identity 
of the most active compound(s) being deduced by decoding 
the tag attached to the bead(s). To test large libraries of 
soluble compounds, mixed pools of encoded beads can be 
created. At each round of testing, only a fraction of the 
compound is cleaved from each bead. Active pools of beads 
are pursued by further subdividing the beads, partially 
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releasing the compounds, and testing for activity. In the 
end, a single bead is identified with the greatest activity 
and the encoding tag is then read. 77 

C. Integration of Combinatorial Technologies for 
Drug Discovery and Future Directions of the 
Field 

Regardless of whether the objective is a broad discovery 
search or analoging a known lead, a key aspect in the 
successful application of combinatorial technologies to 
drug discovery is the requirement for having a closely 
linked, coordinated process for the integration of synthesis 
and screening. The creation and evaluation of molecular 
diversity are two sides of the same coin. It is still premature 
to speculate on which type of format will prove most 
suitable for a particular type of discovery/development 
problem. Over time, a cataloging of successes and failures 
will clarify this issue. In all likelihood, command of a 
collection of combinatorial tools will be required forgeneral 
success. One may imagine a "spectrum of molecular 
diversity" stretching from few to many molecules (see 
Figure 19). Serial synthesis (contemporary medicinal 
chemistry) operates on a few molecules (far right of chart). 
We expect that each individual combinatorial tool/format 
will turn out to be most valuable within proscribed regions 
of numbers of molecules. Hence, recombinant peptide 
diversity is particularly suitable for generating and 
screening large libraries of >10 8 compounds. The VLSIPS 
chip technology, although capable of displaying vast arrays, 
is primarily an analoging tool and is most useful for 
evaluating lOMO 4 compounds. The parallel organic 
synthesis methods of Hobbs DeWitt et alJ® are applicable 
for tens of compounds. Encoded synthetic libraries appear 
to be relevant in broad diversity searching and may also 
prove useful in narrower optimization strategies. Given 
the repetitive nature of many of the manipulations required 
for library construction, an on-going priority will be to 
address the possibility of automating as many aspects of 
the generation/evaluation process as possible. Growth in 
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library size also impacts directly on the physical size of 
compound collections and on the amounts of target 
receptor required for screening. These pressures will 
continue to drive the field toward minaturization and 
exploitation of nanotechnologies. 

The power of combinatorial technologies in generating 
huge numbers of compounds suggests that in a lead- 
discovery mode, less preconceived bias need be brought 
to the process of making molecules. Another way of 
expressing this is as follows: let the numbers do the talking. 
Due to the time and effort required in serial approaches, 
each target molecule is selected with great care. Because 
of the relative ease in creating libraries, little risk is 
incurred, or effort expended, in allowing a wide variety of 
building blocks to participate in diversity generation. Since 
there is less up-front investment in any individual com- 
binatorially created molecule, the combinatorial chemist 
can afford to take more risks. We can think in terms of 
a portfolio of libraries which might be routinely applied 
to the initiation of a drug-discovery search. This is not 
dissimilar to selecting a preliminary screening sample of 
diversity from a large database of individual molecules. 

A related, but still immature, issue in combinatorial 
approaches to drug discovery revolves around the idea of 
"quantitation of diversity". An understanding of the 
concept of "measuring" molecular diversity could impact 
on designing libraries to contain maximal structural 
diversity. This notion has arisen previously in deciding 
which few representative, highly diverse compounds to 
select out of large database collections, when setting up 
groups of preliminary screening samples. The huge 
numbers involved in combinatorial approaches intensifies 
this issue. A number of interesting approaches to the 
diversity quantitation problem can be expected to emerge. 

One working drug-discovery paradigm might be based 
on initially employing a portfolio of biological diversity 
(peptide libraries) together with standard chemical librar- 
ies (various-sized cyclic peptides, cysteinyl- linked cyclics, 
etc.), peptides with carboxyl or carboxamide display, 
synthetic polymeric diversity, as well as large libraries of 
semirigid and acyclic small molecules prepared by COS. 
Over time, favored libraries and directions would be 
expected to emerge. As the sophistication of combinatorial 
organic synthesis grows, the origins of a molecular structure 
as either "combinatorially or serially derived" will gradually 
become transparent. 

Another area where considerable effort must be applied 
is in the registry of libraries and individual library 
members. It is unclear that library compounds should be 
registered and documented for testing in the same ways 
as serially produced compounds historically have been, 
but exactly what changes are necessary remain to be 
determined. Vast numbers of compounds have been and 
are being created; keeping track of these and their 
corresponding biological activities will require innovative 
. database-management techniques. Additionally, nomen- 
clature needs to be developed by which one can simply 
express the constitution, scope, and nature of chemical 
libraries. Legal issues, including the patenting and 
documentating of libraries and their component members, 
will need to be pioneered. 

As repeatedly emphasized, it is obvious but imperative 
to have efficient means of evaluating the molecular 
diversity which is generated. Different assay techniques 
will be format specific. Assays must clearly discriminate 



specific from nonspecific binding. Since in a broad 
screening mode, one is almost always sampling a small 
percentage of the entire universe of diversity (10 10 peptides 
are only 0.1 % of the universe of 10-mers), it is crucial that 
appropriate assay techniques be competent to detect 
modest affinity ligands. The identification of weak binders 
in any of the aforementioned approaches is very important 
and should lead directly to preparation of secondary 
libraries in which original "hite" will become the center- 
piece for more focused diversity creation. This is a 
consequential issue, since application of combinatorial 
technologies are best viewed as an iterative process and 
not a singular event. As the emphasis shifts to analog 
evaluation, assays must be capable of affinity discrimina- 
tion between closely related library members. The tools 
of molecular biology have permitted the molecular engi- 
neering of targets to serve the purposes of screening. The 
rapid introduction of targets into a screening mode will 
require generic techniques for their handling, and ma- 
nipulation of molecular targets by genetic engineering will 
continue to play a crucial role in marrying library 
evaluation and synthesis. Though combinatorial tech- 
nologies may soon prove their worth in the drug-discovery 
process by delivering new leads quickly and cheaply, in 
order to completely fulfill the promise of "making drugs", 
an important question will be whether some of the common 
major obstacles to drug development (e.g., cell penetration, 
bioavailability, pharmacokinetics, metabolism) can be 
productively addressed through the application of com- 
binatorial approaches (i.e., combinatorial drug develop- 
ment). 

In the coming years, cloning and sequencing of the 
human genome promises that an unprecedented abun- 
dance of newly discovered proteins will become available 
as potential drug targets. Gaining even more prominence 
than it now assumes will be the issue of discriminating 
among a myriad of receptors and enzymes to identify valid 
targets for drug discovery. The ability to access potent 
and specific ligands for these targets will guide this process 
by untangling the physiological relevance of endogenous 
biochemical pathways. Combinatorial methods will be 
called upon to provide such molecules to quickly and 
cheaply drive target validation. In this manner, the 
identification of leads will benefit from a significant, but 
hidden, benefit which emerges from combinatorial screen- 
ing; hits derived from chemical libraries should be readily 
amenable to combinatorial analoging. 

Certain drug targets may present more or less of a 
historical precedent with respect to the likelihood of 
successfully identifying a tight binding ligand through the 
use of known pharmacophores. For example, the search 
for specific enzyme inhibitors may be facilitated by the 
intentional enrichment in the combinatorial synthetic 
process of building blocks containing known inhibitory 
pharmacophores. Particularly important or common types 
of drug targets may justify having on hand special libraries 
which are somewhat specific (i.e., a peptidyl hydroxy- 
ethylamine library for aspartyl- and metalloprotease 
inhibition 7 *-* 0 ). On the other hand, in areas where there 
is less current information (e.g., antagonism of protein- 
protein or carbohydrate-protein interactions), a wider 
scope of diversity search should be taken until consistent 
patterns begin to emerge. In the case of newer, less 
explored target groups, combinatorial technologies can 
be expected to assist in unearthing new pharmacophore 
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classes and to help establish an understanding of drug 
design for new types of targets. 

Combinatorial technologies diverge sharply from his- 
torical precedent through a change in emphasis from the 
consideration of individual molecules to thinking in terms 
of populations of molecules. A common, but false, intuitive 
believe is that combinatorial chemistry is necessarily a 
random, screening search; the antithesis of rational drug 
design. In fact, all libraries are biased in some ways. All 
drug company compound files are biased by the historical 
programs of that institution, since a disproportionate share 
of compounds of particular types will have been deposited. 
The notion of intentionally biasing a chemical library is 
a form of drug design, but again not applied to individuals 
but rather to groups or populations of molecules. If a 
scientist hypothesizes on the basis of structural information 
that the current lead molecule contains a type II 0-turn 
motif, then rather than performing two or three serial 
tests of this idea, the combinatorial chemist might create 
a library of narrow diversity utilizing a basis set of 0-turn 
mimetics and thus interrogate many slightly different 
regions of conformational space simultaneously. The drug 
design of populations versus individuals is analogous to 
fishing with a net rather than just a hook. As more 
knowledge of workable strategies for combinatorial syn- 
thesis are understood, it is expected that structural and 
computational input and other rational design information 
will be integrated into a broad combinatorial medicinal 
chemistry approach. 

Gaining a full appreciation of the issues and difficulties 
which must be surmounted in order to perform useful 
combinatorial organic synthesis will initially be a relatively 
slow process, especially because the important strategies 
and decision points differ so markedly from traditional 
organic synthesis. Retrocombinatorial analysis of existing 
pharmacophores and other important structures should 
assist in decision making; both in choosing routes of 
forward synthesis and in synthetic target selection. If 
combinatorial techniques are indeed to become a useful 
shortcut to new leads and optimized compounds, then 
one key implied goal of combinatorial organic synthesis 
is to intersect the pathway of modern medicinal chemistry 
upon which compounds move from the early discovery 
stages to clinical candidacy. Rich incentives await those 
who are able to mass produce important biologically active 
molecules quickly and cheaply. Not surprisingly, an 
aggressive, worldwide effort to understand and master this 
field has already begun. 

This Perspective has been restricted to a consideration 
of the impact of combinatorial technologies on medicinal 
chemistry/drug discovery and development. From the 
point of view of applicability of the technologies, this is 
an artificially narrow view. Combinatorial processes will 
become important in diagnostic medicine, 81 agricultural 
chemistry, food chemistry, immunology, molecular biology, 
polymer studies, inorganic synthesis, and many other 
fields. Though the field of "combinatorial chemistry" is 
chronologically a new enterprise, the evolution of thought 
in this fertile area continues to outrace the experimental 
reduction to practice of many ideas. One may reasonably 
ask **why are combinatorial technologies happening now?**. 
The answer is probably complex and beyond the scope of 
this Perspective. Nevertheless, the explosive recent 
interest in the application of combinatorial technologies 
to drug discovery is symptomatic of an idea whose time 
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has come. Because the issues which confront the medicinal 
chemist differ so radically from historical approaches, the 
combinatorial field will no doubt continue to provide 
impetus and stimulation for the formulation of new 
concepts and ideas. 

Acknowledgment. We thank Drs. A. Zaffaroni and 
G. Ringold for their interest, insights, and support in 
undertaking the application of combinatorial technologies 
to drug discovery and Mrs. Terri Nebozuk for help in 
preparing this manuscript. 



References 

(1) Part 1: Gallop, M. A.; Barrett, R. W.; Dower, W. J.; Fodor, S. P. 
A.; Gordon, E. M. Applications of Combinatorial Technologies to 
Drug Discovery. 1. Background and Peptide Combinatorial 
Libraries. J. Med. Chem. 1994, 37, 1233-1261. 

(2) Barker, P. L.; Bullous, S.; Bunting, S.; Burdick, D. J.; Chan, K. S.; 
Deisher, T.; Eigenbrot, C; Gadek, T. R.; Lipari, M. T.; et al Cyclic 
RGD Peptide Analogues as Antiplatlet Antithrombotics. J. Med. 
Chem. 1992, 35, 2040-2048. 

(3) Al-Obeidi, F.; De L Castrucci, A. M.; Hadley, M E.; Hruby, V. J. 
Potent and Prolonged Acting Cyclic Lactam Analogues of an a-Mel- 
anotropin: Design Baaed on Molecular Dynamics. J. Med. Chem. 
1989, 32, 2665-2561. 

(4) Wiley, R. A.; Rich, D. H. Peptidomimetics Derived from Natural 
Products. Med. Res. Rev. 1993, 23, 327-384. 

(5) Giannis, A.; Kolter, T. Peptidomimetics for Receptor Ligands— 
Discovery, Development, and Medicinal Perspectives. Angew. 
Chem., Int. Ed. Engl. 1993, 32, 1244-1267. 

(6) Kates, S. A.; Sole, N. A.; Johnson, C. R.; Hudson, D.; Barany, G.; 
Albericio, F. A Novel, Convenient, Three-Dimensional Orthogonal 
Strategy for Solid Phase Synthesis of Cyclic Peptides. Tetrahedron 
Lett. 1993, 34, 1549-1552. 

(7) Polinsky, A.; Cooney, M. G.; Toy-Palmer, A.; Osapay, G.; Goodman, 
M. Synthesis and Conformational Properties of the Lanthionine 
Bridged Opioid Peptide [D-Ala L 3 ,AlaL 6 ]Enkephalin as determined 
by NMR and Computer Simulations. J. Med, Chem. 1992, 35, 
4185-4194. 

(8) Albericio, F.; Hammer, R. P.; Garcia-Echeverria, C; Molins, M. A.; 
Chang, J. L.; Munson, M. C; Pons, M.; Giralt, E.; Barany, G. 
Cyclization of Disulfide- Containing Peptides in Solid Phase 
Synthesis. Int. J. Pept. Protein Res. 1991, 37, 402-413. 

(9) Marlowe, C. K. Peptide Cyclization on TFA Lable Ream Using the 
Trimethylsilyl (TMSE) Ester as an Orthogonal Protecting Group. 
Bioorg. Med. Chem. Lett. 1993, 3, 437-440. 

(10) Trzeciak, A.; Bannwarth, W. Synthesis of 'Head to Tail* Cyclized 
Peptides on Solid Support by FMOC Chemistry. Tetrahedron 
Lett. 1992, 33, 4557-4560. 

(11) Andrews, P. C; Boyd, J.; Ogorzalek Loo, R.; Zhao, R.; Zhu, C.-Q.; 
Grant, K.; Williams, S. Synthesis of Uniform Peptide Libraries 
and Methods for Physico-Chemical Analysis. Poster presented at 
the Meeting of the Protein Society, San Diego, July 1993. 

(12) Metzger, J. W.; Wiesmuller, K.-H.; Gnau, V.; Brunjes, J.; Jung, G.; 
Ion-Spray Mass Spectrometry and High Performance Liquid 
Chromatography-Mass Spectrometry of Synthetic Peptide Librar- 
ies. Angew. Chem., Int. Ed. Engl 1993, 32, 894-896. 

(13) Zuckermann, R. N.; Kerr, J. M.; Siani, M. A.; Banville, S. C. Design, 
Construction and Application of a Fully Automated Equimolar 
Peptide Mixture Synthesizer. Int. J. Pept. Protein Res. 1992, 40, 
497—506 

(14) Fodor, S. P. A.; Read, J. L.; Pirrung, M. C; Stryer, L.; Lu, A. T.; 
Solas, D. Light-Directed, Spatially Addressable Parallel Chemical 
Synthesis. Science 1991, 251, 767-773. 

(15) Zuckermann, R. N.; Kerr, J. M,; Siani, M. A.; Banville, S. C; Santi, 
D. V. Identification of Highest-Affinity Ligands by Affinity 
Selection from Equimolar Peptide Mixtures Generated by Robotic 
Synthesis, Proc. Natl. Acad. Sci. UJSA. 1992, 89, 4505-4609; 

(16) Freidinger, R. M.; Perlow, D. S.; Veber, D. F. Protected Lactam- 
Bridged Dipeptides for Use as Conformational Constraints in 
Peptides. J. Org. Chem. 1982, 47, 104-109. 

■ (17) Crowley, J. I.; Rapoport.H. Solid-Phase Organic Synthesis: Novelty 
or Fundamental Concept Acc. Chem. Res. 1976, 9, 136-144. 

(18) Leznoff, C. C. The Use of Insoluble Polymer Supports in General 
Organic Synthesis. Acc. Chem. Res. 1978, 11, 327. 

(19) Frechet, J. M. J. Synthesis and Applications of Organic Polymers 
as Supports and Protecting GroupB. Tetrahedron 1981, 37, 683- 
683. 

(20) Hodge, P. Organic Reactions Using Polymer-Supported Catalysts, 
Reagents or Substrates. In Syntheses and Separations Using 
Functional Polymers; Sherrington, D. C, Hodge, P., Eds.; John 
Wiley and Sons: New York, 1988, pp 43-122. 

(21) Cho, C. Y.; Moran, E. J.; Cherry, S. R.; Stephana, J. C; Fodor, S. 
P. A.; Adams, C. L.; Sundaram, A.; JacobB, J. W.; Schultz, P. G. An 
Unnatural Biopolymer. Science 1993, 261, 1303-5. 



1400 Journal of Medicinal Chemistry, 1994, Vol 37, No, 10 

(22) Simon. R. J.; Kania, R, S.; Zuckermann, R. N.; Huebner, V. D.; 
Jewell, D. A.; BanviUe, S,; Ng, S.; Wang, L.; Rosenberg, S.; Marlowe, 

C. K.; Spellmeyer, D. C.; Tan, R.; Frankel, A. D.; Santi, D. V.; 
Cohen, F. E.; Bartlett, P. A. Peptoids: A Modular Approach to 
Drug Discovery. Proc. Natl Acad. ScL U.S.A. 1992, 89, 9367- 
9371. 

(23) Bartlett, P. A.; Santi, D. V.; Simon, R. J. Int. Patent W091/19735, 
1991. 

(24) Zuckermann, R. N.; Ken, J. M.; Kent* S. B. H.; Moos, W. H. Efficient 
method for the preparation of peptoidstoligo(N-substituted gly- 
cines)] by submonomer solid-phase synthesis. J. Am, Chem. Soc. 
1992, 114, 10646-10647. 

(25) Zuckermann, R N. Presented at the Third International Sympo- 
sium on Solid Phase Synthesis and Complementary Technologies, 
Oxford, U.K., 31 August-4 September, 1993. 

(26) Kerr, J. M.; BanviUe, S. C; Zuckermann, R. N. Encoded Combi- 
natorial Peptide Libraries Containing Non-Natural Amino Acids. 
J. Am. Chem. Soc. 1993, 115, 252&-2531. 

(27) Rich, D. H. Peptidase Inhibitors. In Comprehensive Medicinal 
Chemistry, Hansen, C, Sammes, P. G., Taylor, J. B., Eds.; Pergamon 
Press: Oxford, 1990, Vol. 2. 

(28) Bartlett, P. A.; Marlowe, C. K. Possible Role for Water Dissociation 
in the Slow Binding of Phosphorous-Containing Transition-State- 
Analogue Inhibitors of Thermolysin. Biochemistry 1987,26,8553- 
8561. 

(29) Morgan, B. P.; Scholtz, J. M.; Bellinger, M.D.; Zipkin, L D.; Bartlett, 
P. A. Differential Binding Energy: A Detailed Evaluation of the 
Influence of Hydrogen-Bonding and Hydrophobic Groups on the 
Inhibition of Thermolysin by Phosphorous-Containing Inhibitors. 
J. Am. Chem. Soc. 1991, 113, 297-307. 

(30) Campbell, D. A.; Bermak, J. C. Phosphonate Ester Synthesis Using 
a Modified Mitsunobu Condensation. J. Org. Chem. 1994, 69, 658- 
660. Campbell, D. A. The Synthesis of Phosphonate Esters, an 
Extension of the Mitsunobu Reaction. J. Org. Chem. 1992, 57, 
6331-6336. 

(31) Hagihara, M.; Anthony, N. J.; Stout, T. J.; Clardy, J.; Schreiber, 
S. L. Vinylogous Polypeptides: An Alternative Peptide Backbone. 
J. Am. Chem. Soc. 1992, 114, 656&-6570. 

(32) Smith, A. B., III; Keenan, T. P.; Holcomb, R. C; Sprengler, P. A.; 
Guzman, M. C; Wood, J. L.; Carroll, P. J.; Hirschmann, R. Design, 
Synthesis and Crystal Structure of a Pyrrolinone-Based Peptido- 
mimetic Possessing the Conformation of a 0-Strand: Potential 
Applications to the Design of Novel Inhibitors of Proteolytic 
Enzymes. J. Am. Chem. Soc. 1992, 114, 10672-10674. 

(33) Bunin, B. A.; Ellman, J. A. A General and Expedient Method for 
the Solid-Phase Synthesis of 1,4-Benzodiazepine Derivatives. J. 
Am. Chem. Soc. 1992, 114, 10997-10998. 

(34) Ellman, J. A. Pharmaceutical Manufacturers Association, Drug 
Discovery Management Subsection, September 19-21, 1993, Phila- 
delphia, PA. 

(35) Nikolaiev, V.; Stierandova, A.; Krchnak, V.; Seligmann, B.; Lam, 
K. S.; Salmon, S. E.; Lebl, M. Peptide-Encoding for Structure 
Determination of Nonsequenceable Polymers Within Libraries 
Synthesized and Tested on Solid-Phase Supports. Pept. Res. 1993, 
6, 161-170. 

(36) Hobbs DeWitt, S.; Kiely, J. S.; Stankovic, C. J.; Schroeder, M. C; 
Reynolds Cody, D. M.; Pavia, M. R. "Diversomers": An Approach 
to Nonpeptide, Nonoligomeric Chemical Diversity. Proc. Natl. 
Acad. Sci. U.S.A. 1993, 90, 6909-6913. 

(37) Mapelli, C; Swerdloff, M. D. Monitoring of Conformational and 
Reaction Events in Resin-Bound Peptides by Carbon-13 NMR 
Spectroscopy in Various Solvents. In Peptides. Proceedings of 
the 21st European Peptide Symposium; Giralt, E. and Andreu, 

D. , Eds., ESCOM, Leiden, The Netherlands, 1990; pp 316-319. 

(38) Giralt, E.; Rizo, J.; Pedroso, E. Application of Gel-Phase ia C-NMR 
to Monitor Solid Phase Peptide Synthesis. Tetrahedron 1984, 40, 
4141-4152. 

(39) Nicolaou, K. C; Salvino, J. M.; Raynor, K.; Pietranico, S.; ReiBine, 
T.; Freidinger, R. M.; HirBchmann, R. Design and Synthesis of a 
Peptidomimetic Employing 0-D-Glucose for Scaffolding. Peptides 
1939, 881-884. 

(40) Hirschmann, R.; Nicolaou, K. C; Pietranico, S.; Salvino, J.; Leahy, 

E. M.; Sprengeler, P. A.; Furst, G.; Smith, A. B.; Ill; Strader, C. D.; 
Cascieri, M. A,; Candelore, M. R. ; Donaldson, C. ; Vale, W.; Maechler, 
L. Nonpeptidal Peptidomimetics with a 0-D-Glucose Scaffolding. 
A Partial Somatostatin Agonist Bearing a Close Structural 
Relationship to a Potent, Selective Substance P Antagonist. J. 
Am. Chem. Soc. 1992, 114, 9217-9218. 

(41) Hirschmann, R.; Sprengeler, P. A.; Kawasaki, T.; Leahy, J. W.; 
Shakespeare, W. C; Smith, A. B., Ill, The First Design and Synthesis 
of a Steroidal Peptidomimetic. The Potential Value of Peptido- 
mimetics in Elucidating the Bioactive Conformation of Peptide 
Ligands. J. Am. Chem. Soc. 1992, 114, 9699-9701. 

(42) Hirschmann, R,; Sprengeler, P. A.; Kawasaki, T.; Leahy, J. W.; 
Shakespeare, W. C; Smith, A. B., Ill, The Versatile Steroid 
Nucleus: Design and Synthesis of a Peptidomimetic Employing 
this Novel Scaffold. Tetrahedron 1993, 49, 3665-3676. 

(43) Pannley, S. F.; Smith, G. P. Antibody- Selectable Filamentous Fd 
Phage Vectors: Affinity Purification of Target Genes. Gene 1988, 
73, 305-318. 



Perspective 

(44) Barrett, R. W.; Cwirla, S. E.; Ackerman, M. S.; Olson, A. M.; Peters, 
E. A.; Dower, W. J. Selective Enrichment and Characterization of 
High Affinity Ligands from Collections of Random Peptides on 
Filamentous Phage. Anal. Biochem. 1992, 204, 357-364. 

(45) Bass, S.; Green, R.; Wells, J. A. Hormone Phage: An Enrichment 
MethodforVejriantProteinswithAlteredBm^ Pro- 
teins: Struct. Funct. Genet. 1990, 8, 309-314. 

(46) Devlin, J. J.; Panganiban, L. C.; Devlin, P. E. Random Peptide 
Libraries: A Source of Specific Protein Binding Molecules. Science 
1990 249 404-406. 

(47) Kay.'B. K.; Adey, N. B.; He, Y.-&; Manfred!, J. P.; Mataragnon, 

A, H.; Fowlkes, D. M. An M13 Phage Library Displaying Random 
38-Amino-Acid Peptides as a Source of Novel Sequences with 
Affinity to Selected Targets. Gene 1993, 128, 59-65. 

(48) Barrett, R. W.; Dower, W. J. Unpublished. 

(49) Schatz, P. J. Use of peptide libraries to map the substrate specificity 
of a peptide-modifying enzyme: A 13 residue consensus peptide 
specifies biotinylation inEscherichia coli. Bio/Tech. 1993, 11, 1138- 
1143. 

(50) Doyle, M. Chiron Corporation. Presented at San Francisco Bay 
Area Peptide Society Meeting, December 1992. 

(51) Lowman, H. B.; Bass, S. H.; Simpson, N.; Wells, J. A. Selecting 
high-affinity binding proteins by monovalent phage display. 
Biochemistry 1991, 30, 10832-10838. 

(52) Garrard, L. J.; Yang, M.; O'Connell, M. P.; Kelley, R. F.; Henner, 
D. J. Fab Assembly and Enrichment in a Monovalent Phage Display 
System. Bio/ Technology 1991, 9, 1373-1377. 

(53) Hawkins, R. E.; Russell, S. J.; Winter, G. Selection of Phage 
Antibodies by Binding Affinity: Mimicking Affinity Maturation. 
J. Mol. Biol 1992, 226, 889-896. 

(54) Kang, A. S.; Barbas, C, F.; Janda, K. K; Benkovic, S. J.; Lerner, 
R. A. Linkage of Recognition and Replication Functions by 
Assembling Combinatorial Antibody Fab Libraries Along Phage 
Surfaces. Proc. Natl. Acad. Sci. U.SA. 1991, 88, 4363-4366. 

(55) Christian, R. B.; Zuckermann, R. N.; Kerr, J. M.; Wang, L.; Malcolm, 

B. A, Simplified Methods for Construction, Assessment and Rapid 
Screening of Peptide Libraries in Bacteriophage. J. Mol Biol 
1992,227,711-716. 

(56) Zuckermann, R. N.; Kerr, J. M.; Siani, M. A.; BanviUe, S. C; Santi, 
D. V. Identification of Highest-Affinity Ligands by Affinity 
Selection from Equimolax Peptide Mixtures Generated by Robotic 
Synthesis. Proc. Natl. Acad. ScL U.SA. 1992, 89, 4505-4509. 

(57) Kerr, J. M.; BanviUe, S. C; Zuckermann, R. N. Identification of 
Antibody Mimotopes Containing Non-Natural Amino Acids by 
Recombinant and Synthetic Peptide Library Affinity Selection 
Methods. Bioorg. Med. Chem. Lett 1993, 3, 463-468. 

(58) Tuerk, C; Gold, L. Systematic Evolution of Ligands by Exponential 
Enrichment: RNA Ligands to Bacteriophage T4DN A Polymerase. 
Science 1990, 249 f 505-510. 

(59) Ellington, A. £>.; Szostak, J. W. In vitro Selection of RNA molecules 
That Bind Specific Ligands. Nature 1990, 346, 818-822. 

(60) Sherman, M. L; Bertelsen, A. H.; Cook, A. F. Protein Epitope Tar- 
geting: Oligonucleotide Diversity and Drug Discovery. Bioorg. 
Med. Chem. Lett. 1993, 3, 469-476. 

(61) Irvine, D.; Tuerk, C; Gold, L. Selexion: Systematic Evolution of 
Ligands by Exponential Enrichment with Integrated Optimization 
by Non-Linear Analysis. J. Mol Biol 1991, 222, 739-761. 

(62) Tuerk, C; MacDougal, S.; Gold, L. RNA Pseudoknots that Inhibit 
Human Immunodeficiency Virus Type 1 Reverse Transcriptase. 
Proc. Natl Acad. Sci. U.SA. 1992, 89, 6988-6992. 

(63) Brenner, S.; Lerner, R. A. Encoded Combinatorial Chemistry. Proc. 
Natl. Acad. Sci.U£.A. 1992, 89, 5181-5183. 

(64) Nielsen, J.; Brenner, S.; Janda, K. D. Synthetic Methods for the 
Implementation of Encoded Combinatorial Chemistry. J. Am. 
Chem. Soc. 1993, 116, 9812-9813. 

(65) Arnold, F. H.; Schofield, S. A.; Blanch, H. W. Analytical Affinity 
Chromatography: I. Local Equilibrium Theory and the Measure- 
ment of Association and Inhibition Constants. J. Chromatogr. 
1986, 355, 1-12. 

(66) Arnold, F. H.; Blanch, H. W. Analytical Affinity Chromatography: 
II. Rate Theory and the Measurement of Biological Binding 
Kinetics. J. Chromatogr. 1986, 355, 13-27. 

(67) Zopf,D.;Ohlson,S. Weak-Affinity Chromatography. Nature 1990, 
346, 87-88. 

(68) Holmes, C. P. Polymer Reversal on Solid Surfaces. U.S. Pat. 
5,242,974. 

(69) Li, B.-L.; Longer, J. A.; Schwartz, B.; Petska, S. Creation of 
Phosphorylation Sites in Proteins: Construction of a Phospho- 
rylatable Human Interferon a. Proc. Natl Acad. Sci. UJS.A. 1989, 
86, 558-562. 

(70) Hollenbaugh, D.; Chalupny, N. J.; Aruffo, A. Recombinant 
Globulins: Novel Research Tools and Possible Pharmaceuticals. 
Curr. Opinion Immunol 1992, 4, 216-219. 

(71) ErickBon, J,; Goldstein, B.; Holowka, D.; Baird, B. The Effect of 
Receptor Density on the Forward Rate Constant for Binding of 
Ligands to Cell Surface Receptors. Biophys. J. 1987, 52, 657-662. 

(72) Jayawickreme, C. K.; Graminski, G. F.; Quillan, J. M.; Lerner, M. 
R. Creation and Functional Screening of a Multi-Use Peptide 
Library. Proc. Natl Acad. Sci. U£.A. 1994, 91, 1614-1618. 



Perspective 

(73) Ecker, D. J*; Vickers, T. A.; Hanecak, R; Driver, V.; Anderson, K. 
Rational Screening of Oligonucleotide Combinatorial Libraries for 
Drug Discovery. Nucleic Acids Res. 1993, 21, 1863-1866. 

(74) Blake, J.; Litzi-Davis, L. Evaluation of Peptide Libraries: An 
Iterative Strategy to Analyze the Reactivity of Peptide Mixture 
with Antibodies. Bioconj. Ckem. 1992, 3, 610-613. 

(76) Dooley, C. T.; Chung, N. N.; Schiller, P. W.; Houghten, R. A. 
AcetaUns: Opioid Receptor Antagonists Determined Through the 
Use of Synthetic Peptide Combinatorial Libraries. Proc. Natl, 
Acad. Sci. U.SA. 1993, 90, 10811-10816. 

(76) Houghten, R. A.; Appel, J. R.; Blondelle, S. E.; Cuervo, J. H.; Dooley, 
C. T.; Pinilla, C. The Use of Synthetic Peptide Combinatorial 
Libraries for the Identification of Bioactive Peptides. BioTech- 
niques 1992, 13, 412-121. 

(77) Salmon, S. E.; Lam, K. S.; Lebl, M.; Kandola, A.; Khattri, P. S.; 
Wade, S.; Patek, M.; Kocis, P.; Krchnak, V.; Thorpe, D.; Felder, 
S. Discovery of Biologically Active Peptides in Random Libraries: 



Journal of Medicinal Chemistry, 1994, Vol. 37, No. 10 1401 

Solution-Phase Testing AfterStaged Orthogonal Release from Resin 
Beads. Proc. Natl. Acad. Sci. U.$Ji. 1993, 90, 11706-11712. 

(78) Gordon, E. M.; Godfrey, J. D.; Pluscec, J.; Von Langen, D.; 
Natarajan, S. Design of Peptide Derived Amino Alcohols as 
Transition-State Analog Inhibitors of Angiotensin Converting 
Enzyme. Biochem. Biophys. Res. Commun. 1935, 126, 41^-428. 

(79) Godfrey, J. D.; Gordon, E. M.; Von Langen, D.; Engebrecht, J. 
Synthesis of Peptidyl Derived Amino Alcohols. J. Org, Chem. 1986, 
51, 3073. 

(80) Rich, D. H.; Green, J.; Toth, M. V.; Marshall, G. R.; Kent, S. B. 
H. Hydroxyethylaniine Analogs of the pl7/p24 Substrate Cleavage 
Site are Tight-Binding Inhibitors of HIV Protease. J. Med. Chem. 
1990, 33, 1285-1288. 

' (81) Afrymetrix Co. of Santa Clara, CA, a subsidiary of Affymax, uses 
the VLSIPS process for creation of oligonucleotide libraries on 
chips for the purposes of genetic typing and high- through-put DNA 
sequencing. 



Proc. Natl. Acad. Sci USA EXHIBIT * 

Vol. 92, pp. 6027-6031, June 1995 fcATUDII J 

Pharmacology 



A paradigm for drug discovery employing encoded 
combinatorial libraries 

(medicinal chemistry /isozyme selectivity/combinatorial chemistry) 

Jonathan J. Burbaum*, Michael H. J. Ohlmeyer, John C. Reader, Ian Henderson, Lawrence W. Dillard, 
Ge Li, Troy L. Randle, Nolan H. Sigal, Daniel Chelsky, and John J. Baldwin 

Departments of Biology and Chemistry, Pharmacopeia, Inc., 101 College Road East, Princeton, NJ 08540 

Communicated by Michael H. Wigler, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, February 8, 1995 (received for review 
November 1, 1994) 



ABSTRACT Very large combinatorial libraries of small 
molecules on solid supports can now be synthesized and each 
library element can be identified after synthesis by using 
chemical tags. These tag-encoded libraries are potentially 
useful in drug discovery, and, to test this utility directly, we 
have targeted carbonic anhydrase (carbonate dehydratase; 
carbonate hydro-lyase, EC 4.2.1.1) as a model. Two libraries 
consisting of a total of 7870 members were synthesized, and 
structure-activity relationships based on the structures pre- 
dicted by the tags were derived. Subsequently, an active 
representative of each library was resynthesized {2-[W-(4- 
sulfamoylbenzoyl)-4'-aminocyclohexanespiro]-4-oxo-7-hy- 
droxy-2,3-dihydrobenzopyran and [JV-(4-sulfamoylbenzoyl)-L- 
leucyl] piper idine-3-carboxylic acid} and these compounds 
were shown to have nanomolar dissociation constants (15 and 
4 nM, respectively). In addition, a focused sublibrary of 217 
sulfamoylbenzamides was synthesized and revealed a clear, 
testable structure-activity relationship describing isozyme- 
selective carbonic anhydrase inhibitors. 



The current national focus on health care reform has high- 
lighted the continuing importance of developing more cost- 
effective methods for treating disease. Historically, the phar- 
maceutical industry has contributed to this effort by discov- 
ering and developing therapeutic molecules. The development 
of drug candidates is expensive (1, 2), and preclinical research 
toward the discovery and development of lead structures 
contributes significantly to this expense. For example, in 1991, 
the costs of medicinal chemistry and biological testing prior to 
safety assessment amounted to nearly 30% of domestic re- 
search costs (3). Apart from the economics of drug discovery, 
classical medicinal chemistry cannot efficiently address the 
plethora of new biochemical targets suggested by recent 
discoveries in molecular genetics, and lead structures from 
natural sources are often too complex for the cost-effective 
synthesis of analogs. Hence, targets for therapeutic interven- 
tion cannot be rapidly and efficiently exploited. 

Combinatorial chemical synthesis applied to drug discovery 
promises to improve the productivity of medicinal chemistry 
both by significantly increasing the number of molecules 
available for testing and by providing facile routes toward 
synthetic analogs of active molecules. Many strategies for the 
generation of chemical diversity have been proposed (4-7). 
With a single exception (4), these approaches have been 
confined to the synthesis of flexible oligomeric ligands of 
relatively high molecular weight (e.g., peptides and oligonu- 
cleotides) which tend to be poor therapeutic agents, in part 
because of their lack of availability and stability in vivo. 

For drug discovery, small-molecule libraries containing 
diverse functionality in a variety of molecular scaffolds possess 
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the greatest utility. To construct such structurally diverse 
libraries while allowing the assignment of a structure to each 
member, we use a set of chemically stable molecular tags 
during solid-phase synthesis on polymeric beads (8). These 
tags can be used to unambiguously encode each bead with the 
synthetic scheme for its library member and are sufficiently 
robust to allow a wide range of chemical reactions during 
ligand construction. 

A drug discovery effort based on such combinatorial librar- 
ies must pass several tests to be successful. First, the library 
must be diverse enough to permit the identification of a small 
subset of active molecules among many less active or inactive 
ones. Second, the most promising candidates in the library 
should suggest a testable structure-activity relationship. Third, 
since selectivity for a particular target is often important, 
strategies for the analysis of libraries containing a significant 
fraction of active compounds should result in optimization of 
selectivity as well as activity. 

To explore the capabilities of encoded combinatorial librar- 
ies in a small-molecule drug discovery effort directed toward 
lead identification and optimization, we chose carbonic anhy- 
drase (carbonate dehydratase; carbonate hydro-lyase, EC 
4.2.1.1) as a therapeutically significant model, since inhibitors 
of carbonic anhydrase are known to be useful in ameliorating 
the symptoms of glaucoma. The inhibition of carbonic anhy- 
drase by compounds containing primary sulfamoyl groups 
(-SO2NH2) has been well-characterized, both pharmacologi- 
cally (9-11) and structurally (12-15). This allowed the design 
of test libraries that were predisposed toward showing carbonic 
anhydrase inhibition without explicitly including known inhib- 
itors. Finally, since a number of different forms of the enzyme 
have been characterized, we were able to perform experiments 
to probe isoform selectivity. 

MATERIALS AND METHODS 

Materials. TentaGel S-NH2 resin (particle size, 130 jim) was 
obtained from Rapp Polymere (Tubingen, Germany). The 
photocleavable linker was synthesized as described (16). Car- 
bonic anhydrase was obtained from Sigma. White 96-well 
polystyrene plates were obtained from Dynatech. 

Library Synthesis and Analysis. Libraries were synthesized 
by the split synthesis protocol, and each bead thus contains one 
member of the library (17-19). All members were affixed via 
a photocleavable o-nitrobenzyl linker. To preserve the syn- 
thetic history of each bead, each library was indexed in a binary 
fashion using electrophone tags. These tags were attached by 
using [(CF 3 C02)2Rh]2-activated carbene insertion as de- 
scribed (20). After identification and isolation of beads, the 
tags were analyzed by gas chromatography (8). 
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Synthesis of Dihydrobenzopyran Library. First, three dihy- 
droxyacetophenones (2-acetyl-l,4-dihydroxybenzene, 2-ace- 
tyl-l,5-dihydroxybenzene, and 2-acetyl-6-methyl-l,5-dihy- 
droxybenzene) were coupled using Mitsunobu conditions (tri- 
phenylphosphine/diethylazodicarboxylate in tetrahydrofuran 
at ambient temperature) to the o-nitrobenzyl linker, which was 
then coupled to the beads by carbodiimide coupling [di(iso- 
propylcarbodiimide)/hydroxybenzotriazole in dimethylform- 
amide]. These intermediates were cyclized on the resin with a 
set of seven ketones, four of which contained a protected 
amine functionality. Modification of the amine functionality 
(when present) with 31 different headgroups under standard 
conditions produced a library of ketones containing 381 
members. The ketones were then reduced with NaBH 4 , con- 
verted into the corresponding dithiolanes with ethane-1,2- 
dithiol, or left unaltered to produce a library of 1143 distinct 
compounds. 

Synthesis of Acylpiperidine Library. Library elements were 
coupled as either a carboxylic ester or a carbonate ester of an 
o-nitrobenzyl alcohol. First, five piperidines (2-hydroxy- 
methyl-, 3-hydroxymethyl-, 4-hydroxy-, 4-carboxylic-, and 
3-carboxyIic-), and two linear moieties (5-amino-3-oxopenta- 
nol and 6-aminohexanoic acid) were attached to this linker 
either by phosgene coupling in solution (for the alcohols) or 
directly to the resin (for the acids). Next, 31 amino acids, 
including 28 examples of both D and L isomers of the 20 
commonly occurring a-amino acids, and 3 nonproteinogenic 
amino acids (piperidine-4-carboxylic acid, piperidine-3- 
carboxylic acid, and 6-aminohexanoic acid) were reacted with 
the free amine. Finally, 31 reagents were linked as carboxam- 
ides, carbamates, sulfonamides, and ureas, including three 
primary sulfonamides (4-sulfamoylbenzamido, 4-chIoro-5- 
sulfamoylbenzamido, and 2,4-dichloro-5-sulfamoylbenz- 
amido). 

Library Analysis for Carbonic Anhydrase Inhibition. Beads 
were distributed into 96-well microtiter plates. Compounds were 
detached from the beads by UV irradiation (350 nm) and then 
transferred to assay plates. An assay solution (50 /il) consisting of 
0.1 M phosphate buffer (pH 7.4) containing bovine carbonic 
anhydrase (0.3 jiM) and dansylamide (DNSA; 0.6 /xM) was added 
to each well. Fluorescence values (Aex, 274 run; Ae m , 454 nm) were 
measured with a Perkin-Elmer model LS 50B spectrof luorimeter 
equipped with a microtiter plate reader accessory and were 
normalized (uninhibited, 100; empty, 0). Assay solutions from the 
initial screen identified as active were transferred to a second 
fluorescence plate and 5 /d of 2 mM DNSA in dimethyl sulfoxide 
was added to each well to increase [DNSA] «600-fold in situ. 
Under these conditions, chlorothiazide (IQ « 75 nM) was dis- 
placed, while acetazolamide (Xa **" 7.5 nM) was not, suggesting 
that more potent inhibitors were less sensitive to displacement by 
elevated [DNSA]. 

Selectivity Analysis. Aliquots (one-third) from the same 
bead eluate were assayed against each isozyme. Concentra- 
tions of the two isozymes were matched using eJlo values from 
the literature (21), and the [DNSA] was varied according to the 
literature values for (DNSA) of the isozymes (22). 

RESULTS AND DISCUSSION 

Two model libraries for drug discovery — a 1143-member 
dihydrobenzopyran library and a 6727-member acylpiperidine 
library — were designed. The first (dihydrobenzopyran) library 
(Fig. \A) was designed to include 4-sulfamoylbenzoyl amides 
that were expected to confer carbonic anhydrase activity. The 
second (acylpiperidine) library (Fig. IB) was designed around 
a core aminocarboxamide derived from 31 amino acids, in- 
cluding nonproteinogenic amino acids and both the D and L 
configurations of several naturally occurring a-amino acids. 
The carboxamide was formed from the carboxylic acid with 



seven amines including four substituted piperidines. Three 
different sulfamoylbenzoyl groups were incorporated at the 
final step to confer carbonic anhydrase activity. 

Activity of library members against bovine carbonic anhy- 
drase was assessed by using a fluorescence-based ligand- 
displacement assay (9). Because each synthetic library con- 
tained >1000 beads per member, we characterized the activity 
of the library statistically to optimize our screening conditions. 
The dihydrobenzopyran library was determined to contain 
~1.4% actives, identified as those wells where the fluorescence 
decreased by >5<r. Similarly, in the acylpiperidine library, we 
determined that the samples contained ~6% actives. These 
observations were consistent with the percentage of primary 
sulfonamides incorporated in the two libraries, which was 
predicted to be 3% and 10% for the dihydrobenzopyran and 
acylpiperidine libraries, respectively. 

Over 2300 beads from the dihydrobenzopyran library were 
then assayed singly and 33 individual beads were chosen for 
decoding. Thus, 2.0 library equivalents were assayed, where 
one library equivalent is defined as the number of beads equal 
to the number of distinct library members. The probability that 
a given compound was not assayed is approximated as follows: 
In a library of N total beads » M, the total number of library 
members, when S beads have been assayed, the probability that 
a given member has not been chosen is given by 

which is approximately {e~ x Y /M for Iar g e The quantity S/M 
is L y the number of library equivalents that have been assayed, 
so P s (0) e~ L . For the case at hand, the fraction of the library 
assayed at least once is (1 - e~ 2 ) or 86%. 

The larger acylpiperidine library was pooled at the final 
reaction step into two smaller portions to permit a, more, 
convergent assay strategy. These pools consisted of 3472 and 
3255 members, the larger portion containing all the primary 
sulfonamides. For the smaller portion, *»5 library equivalents 
(— 17,000 beads) were assayed at 10 beads per well, and no 
actives were identified. Thus, in Eq. 1,L « 5 and P s (0) is <0.01. 
Hence, >99% of the members were assayed at least once, and 
none were found to be active under the conditions described. 

We therefore focused our continued efforts on the portion 
of the acylpiperidine library that contained primary sulfon- 
amides. Here, half the eluates from a total of 4320 single beads 
(1.3 library equivalents) were assayed individually, and >300 
actives were identified (Fig. 2A). To estimate the relative 
potency of the active inhibitors, the stringency of the assay was 
increased by raising the concentration of the competitor, 
DNSA. However, the associated background fluorescence 
made the fluorescence decrease caused by inhibitors less easily 
detected. Potent actives were therefore judged as differing by 
>3<7 compared to the median active compound. Eighteen 
high-affinity actives were thereby identified and decoded (Fig. 
2B). 

As anticipated, we found that carbonic anhydrase inhibitors 
from either library exclusively contained the sulfamoyl group 
(-SO2NH2). From the actives, we chose two synthetic targets, 
one from the dihydrobenzopyran library (compound 1) and 
one of the more potent actives from the acylpiperidine library 
(compound 2). These compounds were resynthesized, their 
structures were confirmed spectroscopically, and their K d 
values vs. carbonic anhydrase were determined experimentally 
(Table 1). The low nanomolar values for compounds 1 and 2 
support the use of tag-derived information for structure- 
activity assessments in the libraries we report. 

Significantly, no structures containing a 4-chloro-5-sulfa- 
moylbenzoyl moiety were found among the tag-predicted 
structures for potent actives in the acylpiperidine library. 
Furthermore, while the other two sulfamoylbenzamides were 



Pharmacology: Burbaum et at Proc. Natl Acad. Set USA 92 (1995) 6029 



A Dihydrobenzopyran Library B Acylpiperidine Library 




Tag-Encoding Strategy 



> a 




Bead 



+ Bead [fTFA k Rh] g, 




Encoding ") 



Bead 



Decoding^ 



°^XX7 T *° (NH4)2C6(N03)2 » Tag-OH 

well-represented, they showed markedly different structure- 
activity relationships. For example, the 4-sulfamoylbenz- 
amides appear to prefer a hydrophobic R2 group of the L 
configuration, while the 2,4-dichloro-5-sulfamoylbenzamides 
appear to tolerate more diversity at R 2 , including D-alanine. 
Thus, to test the validity of this observation, library-specific 
homologs of the acylpiperidine (compound 2) were synthe- 
sized (Table 1, compounds 3 and 4). The 150-fold increase in 
K4 seen upon substitution further validates the ability of the 
screen to select active compounds and supports the divergent 
structure-activity relationships observed with the various sul- 
fonamide headgroups. 

In developing a class of compounds for use as therapeutic 
agents, increased affinity is but one criterion for optimization. 
Isozyme selectivity is often an additional critical property 
required to minimize side effects. The selectivity of a large 
library is difficult to optimize convergently (i.e., with >1 
element per assay), because such assays must measure the 



Fig. 1. Synthesis and tagging of an encoded com- 
binatorial library. Library members were synthesized in 
parallel on polymeric beads by a split synthesis protocol 
(17-19). Polyethylene glycol -grafted polystyrene beads 
(TentaGel S-NH 2 ; Rapp Polymere, Tubingen, Ger- 
many) were modified to include a photocleavable o- 
nitrobenzyl linker. General structures and examples of 
structures are illustrated for the dihydrobenzopyran 
library (4) and the acylpiperidine library (£). An out- 
line of the binary tagging method is also shown (C). 
TFA, trif luoroacetate. 



absence of activity against a particular target or relative levels 
of activity vs. multiple targets. As the percentage of actives 
increases, the probability that the observed affinities are the 
property of multiple elements increases. Thus, approaches 
taken to identify high-affinity peptides in a convergent fashion 
(e.g., see ref. 6) become impractical with selectivity, and 
focused sublibraries assayed singly become pertinent. To eval- 
uate the potential for using encoded combinatorial libraries to 
optimize selectivity, a 217-member sublibrary (consisting of 
the 4-sulfamoylbenzamides from the larger acylpiperidine 
library) was prepared and evaluated against human carbonic 
anhydrase isozymes I and II (CA(I) and CA(II)]. 

To isolate selectivity as a variable, two factors were nor- 
malized. First, equal inhibitor concentrations were exposed to 
each isozyme by taking aliquots from the same bead eluate. 
Thus, one-third of a complete eluate was assayed vs. human 
CA(I), and one-third was assayed vs. CA(II). Second, the 
concentrations of the two isozymes were matched, but the 
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Fig. 3A shows a selectivity plot comparing the normalized 
fluorescence values for the two enzymes. In this figure, region 
A contains compounds that show potency vs. both isozymes, 
while region B contains compounds that are less selective for 
CA(II) than the median, and region C contains compounds 
that are more selective than the median. Sulfonamides that are 
not particularly active against either isozyme are found in 
region D. Beads from each region were decoded, and examples 
of the structures predicted by these codes are shown in Fig. IB. 
The potent compound that was identified in screening against 
bovine carbonic anhydrase (Table 1, compound 2) was iden- 



Normalized F 

Fig. 2. • Histogram analysis of combinatorial libraries. (A) Low- 
stringency library analysis. Half the eluates from a total of 4320 single 
beads (1.2 library equivalents) were assayed individually, and >300 
actives (5<r) were identified. For the dihydrobenzopyran library, 
>2300 members (2.0 library equivalents) were assayed singly and 33 
individual beads were identified and decoded. (B) Reanalysis of 
positives from>l at higher stringency. To increase the stringency of the 
assay, the concentration of DNSA was increased ^600-fold in situ, and 
18 potent actives were selected for decoding. 

[DNSA] was varied according to the literature values for 
^dnsa of the isozymes. These requirements derive from the 
binding equation, given relative to isozyme x 



F x « [E^-DNSA] - 



[E] 



tot,* 



-KpNSA,* / [I] \ 

[DNSAlA Kj 



[2] 



1 + 



Thus, when E tot is equal, the ratio [DNSAJ/ATdnsa is equal, and 
[I] is equal, the only variable that can affect F is the dissociation 
constant for the inhibitor, Ki. To identify samples for which 
there is a significant difference in K\ for the two isozymes, the 
normalized fluorescence values were analyzed graphically 
(Fig. 3). 

Table 1. Analysis of compounds and validation of tag-derived 
structure-activity relationships 
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Kd[bCA(II)l, 
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Structures of compounds 1-4 were confirmed analytically by ! H 
NMR spectroscopy and mass spectrometry. K4 values were measured 
as described (7) with excel software to solve the multiple equilibria 
numerically, b, Bovine. 
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Fig. 3. Selectivity assay of a highly active sublibrary. (A) Inhibitors 
were analyzed independently for each isozyme as in Fig. 2 and then 
plotted. The observation that the majority of the library members are 
found above the diagonal suggests that the selectivity for hCA(II) may 
be a property of this class of compounds rather than a characteristic 
of a given member. (B) From tag-derived relationships, there are clear 
trends associated with the different regions, and some examples of 
compounds are shown. The characteristics of the potent inhibitors 
(region A) are described above, and the potent compound that was 
identified in assays against bovine carbonic anhydrase (Table 1, 
compound 2) was found here. Region B (relatively nonselective) shows 
a preference for cyclic imino acids in the central region, which may 
reflect differential hydrogen bonding by the carboxamido nitrogen of 
the sulfamoylbenzamide (13) or to a positive discrimination by the ring. 
Region C [relatively hCA(II) selective] shows a preponderance of 
polar side chains, as well as a relative lack of sensitivity toward the C a 
configuration. Region D (low activity) contains more polar com- 
pounds but, notably, one D-leucyl side chain that complements the 
strong preference for hydrophobic groups of the L stereoconfiguration 
noted above. 
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tified again in region A. Analysis of the selectivity of this 
compound showed that it has a 3-fold selectivity for human (h) 
CA(II) {X d [hCA(II)] = 11 nM;tf d [hCA(I)] = 33 nM}, but this 
fact would not be apparent from the plot because of its 
relatively high affinity. Since the other compounds in this small 
sublibrary are also analogs of 4-sulfamoylbenzamide, the ob- 
servation that most library members are found above the 
diagonal suggests that the selectivity observed for compound 
2 may be a property of this class of compounds rather than a 
characteristic of this particular member. Nevertheless, there 
are clear trends associated with the different regions. Region 
B (relatively nonselective) shows a preference for cyclic imino 
acids in the central region, which may reflect differential 
hydrogen bonding by the carboxamido nitrogen of the sulfa- 
moylbenzamide (13), or to a positive discrimination by the ring. 
Region C [relatively hCA(II) selective] shows a preponderance 
of polar side chains as well as a relative lack of sensitivity 
toward the C a configuration. Region D (low activity) contains 
more polar compounds but, notably, one D-leucyl side chain 
that complements the strong preference for hydrophobic 
groups of the Lstereoconfiguration noted above. Together, the 
trends derived from library analysis suggest that more selective 
compounds could be synthesized, either as single compounds 
or as appropriately designed focused libraries. 

Conclusions. We have synthesized encoded libraries con- 
taining 10 3 — 10 4 small molecules by using diverse chemistries. 
Active carbonic anhydrase inhibitors within these libraries 
have been identified and optimized by using directed subli- 
braries based on the original lead and the chemistry developed 
for its synthesis. This application of combinatorial libraries for 
drug discovery addresses two pressing needs of the pharma- 
ceutical industry. First, small-molecule leads will become more 
readily available as the size and diversity of synthetic libraries 
increases. This, in turn, will accelerate the discovery of com- 
pounds active against a variety of therapeutic targets, espe- 
cially those that emerge from the explosion in sequence 
information arising from the human genome project. Second, 
synthetic chemists involved in drug discovery will be more 
efficiently engaged to solve sophisticated structure-activity 
problems. The success of the combinatorial approach empha- 
sizes the power of techniques for the rapid synthesis and 
evaluation of large numbers of possible structures and antic- 
ipates a new age of drug discovery and optimization. In the 
future, analysis of combinatorial libraries may be used to direct 
research efforts toward productive avenues by suggesting the 
synthesis either of a handful of pertinent test compounds or of 
combinatorial libraries based on relevant structural types. 
Indeed, such a combinatorial approach may be productively 
applied not just in pharmaceutical discovery, but in any area 
where optimization of molecular properties is necessary. 
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ABSTRACT We report a general mass spectrometric 
approach for the rapid identification and characterization of 
proteins isolated by preparative two-dimensional polyacryl- 
amide gel electrophoresis. This method possesses the inherent 
power to detect and structurally characterize covalent modi- 
fications.. Absolute sensitivities of matrix-assisted laser de-. 
sorption ionization and high-energy collision-induced disso- 
ciation tandem mass spectrometry are exploited to determine 
- the mass and sequence of subpicomole sample quantities of 
tryptic peptides. These data permit mass matching and se- 
quence homology searching of computerized peptide mass and 
protein sequence data bases for known proteins and design of 
oligonucleotide probes for cloning unknown proteins. We have 
identified 11 proteins in lysates of human A375 melanoma 
cells, including: a-enolase, cytokeratin, stathmin, protein 
disulfide isomerase, tropomyosin, Cu/Zn superoxide dis- 
mutase, nucleoside diphosphate kinase A, galaptin; and tri- 
osephosphate isomerase. We have characterized several post- 
translationaJ modifications and chemical modifications that 
may result from electrophoresis or subsequent sample pro- 
cessing steps. Detection of comigrating and coyaiently modi- 
fied proteins illustrates the necessity of peptide sequencing 
and the advantages of tandem mass spectrometry to reliably 
and unambiguously establish the identity of each protein. This 
technology paves the way for studies of cell-type dependent 
gene expression and studies of large suites of cellular proteins 
with unprecedented speed and rigor to provide information 
complementary to the ongoing Human Genome Project. 

Cloning of genes associated with malignancy generates inev- 
itable excitement in biology and medicine. However, subse- 
quent study at the protein level is clearly necessary to under- 
stand the processes by which those genes affect vital biological 
functions, such as the control of gene expression and regula- 
tion of cell-signaling pathways. Two-dimensional (2D) PAGE 
is preferred for simultaneous, separation and visualization of 
proteins present in cell lysates (1, 2). Protein identification and 
characterization of functionally important primary structural 
features are . the first steps toward gaining insight into the 
biological roles of specific proteins (3-6). Furthermore, char- 
acterization of the 2D-PAGE map from a cell system facilitates 
both basic research and clinical diagnosis (3, 7). 

Traditional partial-sequencing approaches for identifying 
proteins isolated by 2D PAGE, such as Edman degradation of 
electroblotted proteins, often meet with limited success due to 
N-terminal blockage of many eukaryotic proteins. While pep- 
tide cleavage, extraction, and HPLC separation may enabie 
subsequent Edman sequencing of internal peptides (8), these 
procedures alone are not rapid enough to identify the thou- 
sands of proteins in a cell lysate in a timely manner. Because 
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of their speed, sensitivity, and ability to deal directly with 
mixtures, several recently developed mass spectrometry tech- 
niques are rapidly becoming the primary methods for identi- 
fying, proteins isolated by 2D PAGE (4-6, 9, 10). Previously, 
our laboratories reported the advantages of using liquid sec- 
ondary ion mass spectrometry, high-energy collision- induced 
dissociation (CID) tandem mass spectrometry, Edman se- 
quencing, and 2D PAGE .to characterize lipo'cortin I from 
human melanoma lysates (4). Peptide mass determination and 
CID sequencing, using sample quantities of ***100 pmol re- 
vealed an acetylated N terminus and an unanticipated acryi- 
amide-modified cysteine. 

Recently, we have substantially reduced the detection limits 
of peptide sequencing through incorporation of continuous, 
flow sample introduction and a scanning charge -coupled de- 
vice array detector onto our tandem mass spectrometer. The 
resulting chemical noise reduction and rapid recording of 
single CID spectra (~10 s) now enable routine peptide se- 
quencing from sample quantities of 100 fmol to 10 pmol (11, 
12). Prior to CID, substantial sample is conserved by exploiting, 
the 1-100 fmol sensitivity of matrix- assisted laser desorption 
ionization (MALDI), rather than liquid secondary ion mass, 
spectrometry for preliminary mass measurement. MALDI 
enables the generation of peptide molecular-mass maps from 
fmol levels of unfractionated protein digests or individual 
HPLC fractions. These peptide-mass fingerprints may be used 
to search peptide-mass data bases and predict protein identi- 
ties without resorting to sequencing (5, 9, 13). However, 
mass-matching algorithms can find only proteins which are 
already present in a data base. Obviously, primary structure 
elucidation is essential to identify unknown proteins, search 
sequence data. bases for homologous proteins, characterize 
covalent modifications,, or design oligonucleotide probes for 
gene cloning. Taking these points into consideration, we report . 
an integrated strategy for identifying and characterizing pri- 
mary structural features of proteins isolated by 2D PAGE.. 

MATERIALS AND METHODS 
Protein Isolation, Purification, Digestion, and Edman Se- 
quencing. Preparation of human A375 melanoma lysates, 
isolation of proteins by 2D PAGE (4, 14), protein electroeiu- 
tion of pooled gel spots, tryptic digestion, HPLC separation, 
and Edman sequencing were performed as described (4), The 
procedure of Rosenfeld^ra. (15) was used to produce an in -gel 
tryptic digest of spot 42. Control digestions without protein 
substrate were performed so that trypsin autoproteolysis prod- 
ucts could be disregarded. 

Abbreviations: 2D, two-dimensional; CID, collision-induced dissoci - 
ation;. MALDI, matrix-assisted laser desorption ionization. 
^Present address: Department of Chemistry, Illinois State University, 
Normal, IL 61790. 

*To whom reprint requests should be sent at the t address. 
"To whom reprint requests should be sent at die * address. 
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Mass Spectrometry. Molecular masses (iso^opic average) of 
all tryptic peptides were determined by analyzing l/50th of 

. each HPLC fraction with a VG TofSpe- MALDI mass spec- 
trometer equipped with a nitrogen ^ser and operated in the 
linear mode. Peptides were crystallized in matrices consisting 
of 100 mM 2,4-dihydroxybeneoic acid/50 mM fucose or a 
saturated solution of a-cyr-rio-4-hydroxycinnarnic acid pre- 
pared' in 0,1% aqueous trifl'-ioroace tic acid (TFA). The mean 
mass from all spectra recorded for a particular peptide is 
reported. All MALDI spectra were externally calibrated by 
using a standard peptide mixture. High-energy positive ion 
C1D mass spectra were acquired on a Kratos Analytical 
Instruments Concept 'IIHH four-sector tandem mass spec- 
trometer equipped vith a continuous flow, liquid inlet probe 
and a scanning charge^coupled device array detector (11). 
Both MSI and MS2 were operated at 1000 resolution (M/A/n) 
to determine monoisotopic masses. HPLC fractions were 
concentrated to **5 ju.1 and dilured to **15 juJ with a mixture of 
aqueous 5% (vol/vol) thioglycerol/5% (vol/vol) acetonitrile/ 

.0.1% TFA matrix solution. Samples were introduced into the 
mass spectrometer source at a flow rate of 3 jxl/min. Sequences 
were deduced from CID spectra by using interactive interpre- 
tation software developed in our laboratory (16). Quantitation 
estimates were based on CID and UV response of standards. 

Immunoblot Analysis. 2D immunoblot analysis was per- 
formed as described (4) with modifications. Gels were loaded 
with 400-800 ^g of protein, lysate and. transferred at 13 

. mA/cm 2 for 2 h, and blots were stained with 0.1% Ponceau 
S/5% (vol/vol) acetic acid. Stain was removed with 100 mM 
NaOH prior -to .antibody staining (data not shown). 

Data Base Searching. The OWL protein sequence data base 
(17) was searched by using blast (18) with peptide' sequences 
obtained by high-energy CID analysis: or Edman degradation. 
Masses obtained by MALDI were used by MOWSE (internet 
e-mail server version 5.1; mowse@dl.ac.uk) to search a peptide - 
mass data base constructed from a theoretical trypsin digest of 
the entire OWL data base (9). Typical parameters employed were 
a 15% gel-derived protein-mass tolerance, a 2- or 3-Da pep tide - 
mass tolerance, and a partial cleavage scoring factor of 0.4. 

RESULTS AND DISCUSSION 

Fig. 1 illustrates a representative 2D preparative gel containing 
human A375 melanoma proteins from whole cell lysates. The 
spots selected for mass spectrometry analysis were chosen be- 
cause of their abundance, reproducibility, and relative isolation. 
Our strategy for the identification and characterization of these 
proteins is outlined in Fig. 2. Protein spots from several prepar- 
ative gels were excised, and spots of identical mass pi were pooled 
before purification from the gel matrix, digestion with trypsin, 
and peptide separation by HPLC. The yield, and number of 
peptides recovered from each spot were not always proportional 
to the number of gel plugs excised or the amount and size of 
protein present, suggesting differential digestion susceptibility, - 
sample handling loss, and occasional protein comigration. For 
each spot, sub-pmol aliquots of HPLC fractions were analyzed by 
MALDI. The list of masses obtained from each protein digest 
served as the focal point for guiding subsequent experiments for 
identifying and characterizing primary structural features in the 
protein(s). The number of peptides recovered from digestion of 
a protein in a 2D PAGE spot varied and depended on protein size 
and the occasional comigration of other proteins. Experimentally 
determined masses were used with MOWSE data-base searching 
to match theoretical peptide masses and attempt to predict 
protein identities. HPLC fractions -containing peptide masses 
below 2 kDa were subjected to high-energy CID for peptide 
sequencing. Protein identities were established by using the 
determined sequences to search the OWL protein sequence data 
base by using blast. Edman degradation was used to sequence 
peptides with masses greater than ~2 kDa, since larger peptides 
ionize less efficiently by liquid secondary ion mass spectrometry. 




514 SIS £2 
Isoelectric point, pi 

.. Fig. 1.. Coomassie blue G-250-stained 2D preparative gel of 
human A375 melanoma proteins, numbered as in Tables 1 and 2. One 
milligram of total protein was loaded. The amount of protein -esti- 
mated by densitometry in designated spots varied from 0.6 to 2.9 p,g. 

" Tables 1 and 2 summarize peptide mass data, sequences deter- 
mined or attributed by mass, and data-base search results for all 
spots studied. Ideally, we seek to attribute every mass to a unique 
sequence either by .sequencing each peptide or matching it by 
mass to a peptide from the characterized protein. 

From the 39 peptide masses from spot 3 listed in Table 1, 
MOWSE predicted the protein identity as a-enolase. Although 
most cell types demonstrate a-enolase activity, distribution of the 
specific enolase isoenzymes in biologically active dimers may be 



Remove protein(s) from gel spot by 
electro-elution* in-gel digestion, or electro-blotting 



Enzymatically digest ail protein(s) from spot 



Analyze 
next spot 



Separate peptides by HPLC 



Determine mass of each oe pride by 
MALDI MS or LOESIMS 



>2000 Da 




design oligo- 
nucleotides 
for cloning 



Determine intact masses 
of protein(s) by 
ESI or MALDI MS 



Fig. 2. Strategy for identifying and characterizing proteins sepa- 
rated by 2D PAGE. ESIMS, etectrospray ionization mass speorometry. 



) 



5074 Biochemistry: Clauser et at 



Proc. NatL Acad ScL USA 92 (1995) 



tissue specific. Only the a- and y-homodimers can be detected in 
■ serum from patients with malignant ocular melanoma (19). 
During sequence analysis on selected peptides from spot 3, we 
also found a cytokeratin sequence LASYLDK that is present in 
many type I cytoskeletal keratins (mass/pi and peptide masses 
best matched cytokeratin 15). This indicated that at least two 
proteins had comigrated. Immunobiot analysis confirmed that 
a-enolase is present. Furthermore, the presence of a cytokeratin 
is consistent with the localization of many keratin subtypes to this 
gel region (20). However, the presence of cytokeratin was not 
predicted by MOWSE when masses exclusive of those attribut- 
able to a-enolase were used. 

Protein disulfide isomerase (PDI) and stathmin were found in 
spots 30 and 24, respectively, and MOWSE readily predicted 
both. Sequencing of several peptides by high-energy QD estab- 
lished these identities. PDI, which catalyzes formation and inter- 



conversion of disulfide bonds in the endoplasmic reticulum, has 
been implicated in the activation of interferon- inducible genes in 
chronic myelogenous leukemia cells (21), Immunobiot analysis 
confirmed the presence of PDI in spot 30 and showed stathmin 
presence in spot 24 and in two nearby (more acidic) spots. 
Serine-phosphorylated forms of stathmin have been found in 
T-lymphocytes (22). The ability to suggest additional posttrans- 
lationally modified or genetically variant isoforms of a protein was 
one motivation for incorporating immunobiot analysis into our 
strategy. 

In spot 33 fibroblast nonmuscie tropomyosin was identified 
following high-energy CID. Similarly, another form of tropo- 
myosin, cytoskeletal type, was identified, in spot 34. The two 
protein sequences are 76% identical. High-energy CID showed 
that the masses of the N-terminal peptides from both proteins 
differed by only 1 Da, due to an internal sequence difference 



Table 1. Summary of data obtained for nine human melanoma proteins from eight 2D PAGE gel spots 
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1583.8 


3.0 




U lO£j t * [ Jf. AL/af J.OJ 


oO/.j 


no 




1239.1 


1.9 




(humplcalfl.gb_pr) 


790.0 


2.1 


(K)FLDAGHK(L) 


1192.1 


-0.2 






1459.1 


1415.3* 




998.2 


2.0 


33 


Tropomyosin, 


1241.1 


-3.3 


(R)IQLVEELDR(A) 


1740.3 


-3.1 




fibroblast non-muscle type 


. 1193.3 


-1.7 


(M)acAGLNSLEAVKR(K) 


1614.7 


-1.0 




P07226 [30 kDa/4.5] 


721.0 


-1.8 


(R)AEFA£R(S) 


1297.5 


-2.1 




(tpmg_human.swiss)' 


1690.4 


1492.7 


1427.7 1410.5 1253.2 1192.6* 


1272.1 


-1.3 






1023.7 


893.0 


799.3 762.1 696.6 689.8* 


1170.0 
1168.6 


-2.3 
-2.3 




. Tropomyosin, 








1014.7 


-1.4 


34 


1235.1 


-0.4 


" (R)KLVltEGDLER(T) 


1773.1 


1.2 




cytoskeletal type 


1202.0 


- 0.4 


(M)acAGITTIEAVKR(K) 


1726.7 


-2.2 




P12324 [29 kDa/4.6] 


1155.2 


-22 


(K)LVIIEGDLER(T) 


1670.2 


-17 




(tpmi_human.swiss) 


1044.4 


-1.0 


(M)acAGrTTIEAVK(R) 


1471.4 


-2.2 






329.1 


-1.9 


(K)ILTDKLK(E) 


1380.9 


-0,7- 






774.3 


-L5 


(R)AELAESR(C) 


1241.2 


-3.2 






743,1 


-1.3 


(R)LATALQK(L) 


1130.4 


-2.0 






1958:6 


1697.8 


1481.0 1395.9 1295.9 1282.3* 


921.5 


0.5 






1255.1 


1168.2 


1148.2 1073.1 934.3 372,0* 


' 873.7 


-1.2 


35 


Cu/2!ji superoxide dismutasc 


822.8 


'.-3.2 


(R)TLWHEXM) 


1023.5 


-1.6 




P00441 [18 kDa/5.7] 


638.1 


-1.7 


(K)VWGSIK(G) 


771.7 


-2.1 




(not predicted) 1 


2230.4' 


1972.6 


1498.5 1151.3 1127.6 U01.2 1039.3* 


662.3 


0.1 


36 


Nucleoside diphosphate 
kinase A 


1344.3 
1149.2 


-1.3 
-1.2 


(R)TF1A IKPDGVQR(G) 
(K)DRPFFAGLVK(Y) 


1193.1 


-0.3 




P15531 [18 kDa/5.8] 


1065.1 


-1.1 


(R)GDFCamiQVGR(N) 








(ndka_human.swiss)* 


1800.7 


825.6 


656.4* 






41 


Galaptin 


1661.3. 


-1.5 


(R)FNAHGDANTIVCamNSK(D) 


1545.2 


0.6 




P09382 [16kDa/5.1] 


15012 


0.6 


(K)DSNNLCamLHFNPR(F) 


1075.5 


-1.5 




(not predicted)' 


377.3 


-0.8 


(K)SFVLNLGK(D) 


969,0 


-0.1 






2400.1 


2210.1 


2067.3 1991.4 1950.1 1940.4 1849.8* 


909.5 


-0.5 






1822.0 


1803.4 


1781.1 1680.1 152S.3 1516.4* 







Peptide sequence consistent 
with mass* 

(R)EAMRIGAEVYHNLK(N) 

(K)WIGMDVAASEFFR(S) 

(R)YISPDQLADLYK(S) 

(K)GVSKAVEHINK(T) 

(R)IGAEVYHNLK(N) 

(R)IEEELGSK(A) 

(K)TIAPALVSK(K) 

(K)GVPLYR(H) 

(R)VLAEMREQYEAMAEK(N) 

(K)EVASNTEMIQTSK(T) 

(R)EQYEAMAEK(N) 

(R)ASGQAFEULSPR(S) 

(K)ESVPEFPLSPPK(K) 

(K)AIEENNNFSK(M) 

(R)KSHEAEVLK(Q) 

(R)EATNPPVIQEEKFK(K) 

(R) D G EE AG A YDG PR (T) 

( RJLAPEYEA A ATR(L) 

(K)QAGPASVPLR(T) 

(R)KIQALQQQADEAEDR(A) 

(K) fQALQQQADEAEDR(A) 

(R)KLVILEGELER(A) 

(R)EKAEGDVAALNR(R) 

(K)AEGDVAALNRR('I) 

(K) L V I LEG ELER( A) 

(K)AEGDVAALNR(R) 

(R)KJQVLQQQADDAEER(A) 

(R)IQLVEEELDRAQERfL) 

(K)LVIIEGDLERTEER(A) 

(R)EQAEAEVASLNRR(I) 

(K)YEEEIKILTDK(L) 

(R)IQLVEEELDR(A) 

(K)MELQEIQLK(E) 

(R)EMDEQIR(L) 

(R)EVEGERR(A) 

(K)HGGPKDEER(H) 

(K)crESNGPVK(V) 

(K)TGNAGSR(L) 

(K)crD RPFFAGLVTC(Y) 



(K)crDSNNLCamLHFNPR(F) 
(K)DGGAWGTEQR(E) 
(K)LPDGYEFK(F) 
(K)crlKCam VAFD( ) 

i.3 1516.49 

'The difference between the measured mass and calculated mass (average isotopic). 

* Abbreviations: ( ), residues before/after peptide; Cam, acrylamide-modified Cys: Camo, Cam (oxidized); acX, acetyiated N terminus; crX, carbamyiated N terminus; 
Mso, Met sulfoxide. 
^Sequenced by Edman degradation. 
*Masses aeither identified aor attributed. 
'Masses ambiguous: a-enotase or cytokeratin. 
'inconclusive search result. 



* Biochemistry: Causer et at p rQC tf at i ^cad Set USA 92 (1995) 5075 

Tabic 2, Summary of data obtained for triosephosphate isomcrase from 2D PAGE gel spots 37 and 42 

Protein idenrified 
OWL accession 



Spot no. 



no. [Mass/pi] 
(MOWSE search 
■ prediction) 


MALDI 
mass, 
Da 


Da 


Peptide sequence determined by 
high-energy CID* 


MALDI 
mass, 
Da 


A". 

Da 


Triosephosphate 


1647.7 


3.0 


(K)crDCamGATWWLGHSER(R) 


3026.6 


-4.3 


isomcrase 


1620.9 


-2.0 


(K)VTNGAFTGEISPGMIK(D) 


2208.2 


0.7 


P00938 [25 


1615.6 


-0.2 


.( R) RHV FGESD ELI GQK(V) 


IS36.S 


-2.4 


•kDa/7.0j 


1599.2 


-2.6 


(K)DCamGATWVVLGHSER(R) 


1320.6 


-2.6 


(tpis_humaa.swiss) 


1595.1 


-8.8 


(K)WLAYEPVWAIGTGK(T) 


1638.9 


0.0 




1459.0 


-0.6 


(R)HVFGESDELIGQK(V) 


1607.5 


1.6 




1408.7 


-6.9 


(K)QSLGELIGTLi^AAK(V) 


1591.5 


0.6 




1330.9 


-3.7 


(R)crllYGGSVTGATCamK(E) 


1503.3 


1.2 




1231.2 


1.3 


(K)DCamGATWWLGH(S) 


1468.7 


1.1 




1196.1 


0.7 


(K)crlAVAAQNCamYK(V) 


1341.3 


-0.3 




1163.7 


0.3 


(K)lAVAAQNCaraoYK(V) 


1278.2 


2.7 




956.4 


1.3 


(K)FFVGGNWK(M) 


1234.4 


-0.9 




S92.5 


-1.5 


(K)crWFEQTnC(V) 


1151.2 


-1.2 




758.9 


0.0 


(K)VIADNVK(D) 


1123.1 


-3.2 




1747.1 


1643.5 


1481.4 1469.8 1109.4 826.5§ 


1081.9 


-1.4 


Triosephosphate 


1760.1 


2.1 


(K)DCamGATWWLGHSERR(H) 


3035.3 


3.9 


isomerase 


1602.6 


0.8 


(K)DCamGATWWLGHSER(R) 


2342.9 


6.2 


P00938 [25 


1407.7 


-7.9 


.CK)QSLGELIGTLNAAK(V) ... 


. 2203.9 


-3.6 


kJDa/6.4] 


1340.1 


-1.5 


(R)IIYGGSVTGATCaniK(E) • 


1826.4 


3.2 


(tpisjiu man. Swiss) 


1235.7 


0.4 


' (K)SNVSDAVAQSTR(I) 


1615.6 


-0.2 




953.1 


-2.0 


(K)FFVGGNWK(M) 


1604.7 


0.8 




2210.7 


2089.8 


2034.2 1823.4 1594.4 1400.3$ 


1543.4 


-0.4 




1063.0 


1060.45 




1467.1 


-0.5 










1457.9 


-1.7 










1151.5 


-0.9 










1082.6 


-0.7 










757.3 


-1.1 










, 747.0 


-0.8 



Peptide sequence consistent with mass* 



42 



( K) E LASQ P D VDG FL VG G AS LKP EFVD UN AK(Q ) 

(K)VPADTEVVCamAPPTAYIDFAR(Q) 

(K)VAHALAEGLGVIACamoiGEK(L) 

(KJVAHALAEGLGVlACamlGEKfL) 

{K)VTNGAFTGEISPGiMso[K(V)* ' 

(K)LDPKlAVAAQNCamYK(V) 

(K)VVFEQTKVIADNVK(D) 

(R)crHVFGESDELIGQK(V) 

(K)TATPQQAQEVHEK(L) 

(R)IIYGGSVTGATCamK(E) 

(K)VTADNVKDWSK(V) 

(K)SNVSDAVAQSTR(I) 

(K) I A V A AQNCam YK( V) 

(R)ctKFFVGGNWK(M) 

(R)KFFVGGNWK(M) 

(K)ELASQPDVDGFLVGGASLKPEFVDHNAK(Q)- 

( K) V AH A LAEGLG VI ACa m IG EKLD ER (E) * 

(K) YPADTEVV CamAPPTAYID FAR(Q)* 

(K) VAH A IAEGLG VI ACamIGEK(L)* 

(R)RHVFGESDELiGQK(V) 

(K)WLAYEPVWAIGTGK(T)t 

(R)KQS LGELIGTLNAAK( V) 

(K)TATPQQAQEVHEK(L) 

(R)HVFGESDELIGQ'K(V) 

(K)lAVAAQNCamYX(V) 

(R)KFFVGGNWK(M) 

CK)VIADNVK(D) 

(R)EAGITEK(V) 



"The difference between the measured mass and calculated mass (average iso topic). 

Abbreviations: ( ), residues before/after peptide; Cam, acrylamide -modified Cys; Camo, Cam (oxidized); acX, acetyiated N terminus; crX. carbamylated N terminus; 

Mso, Met sulfoxide, 
^Sequenced by Edman degradation. 
^Masses neither identified nor attributed. 
Masses ambiguous: a^enolase or cytokeratin. 
• ^Inconclusive search result 



of ITTI (spot 34) vs. LNSL (spot 33). The presence of 
N-terminal acetylation is evident in Fig. 3 from the 42-Da 
increase in the mass of ail N-terminally derived a- and 6-type 
ions. Leucine at residues 3 and 6 is evident in Fig. 3 from the 
mass values of the w6 (656.3 Da) and w9 (970.4 Da) ions 
resulting from side-chain fragmentation. Isoleucine was like- 
wise assigned at those same positions in the corresponding 
. peptide from spot 34 (data not shown). 

Differentiation .and unambiguous structure assignment of 
these similar peptides illustrates the power of high-energy CID in- 
protein primary structure determination. Methodology and frag- ' 
ment-ion nomenclature for high-energy CID are more thor- 
oughly described elsewhere (12, 23). Although their biological 
function in nonmuscle cells is not clear, the tropomyosins are 
multiple isoforms which complex with microfilaments (24). While 
spot 34 was readily predicted as cytoskeletal tropomyosin by 
MGWSE, fibroblast nonmuscle tropomyosin was the only iso- 
forrh among several ambiguous possibilities predicted for spot 33. 
Immunoblot analysis confirmed that tropomyosin is present in 
spots 33 and 34 and two more basic spots nearby. 

Nucleoside diphosphate kinase A (NDPK-A) was identified 
in spot 36 following high-energy CID. NDPK catalyzes the 
phosphorylation of nucleoside 5' diphosphates. NDPK-A is 
the product of the nm23 gene, which, when overexpressed, 
decreases tumorigenesis of melanoma cell lines (25). MOWSE 
results for this spot were inconclusive, due in part to chemical 
modification of two peptides. 

Cu/Zn superoxide dismutase (Cu/Zn SOD) and galaptin were 
identified in spots 35 and 41, respectively, following high-energy 
CID. Immunobiot analysis confirmed the .presence of both 
Cu/Zn SOD and galaptin. Cu/Zn SOD catalyzes the conversion 
of toxic superoxide to hydrogen peroxide. Gaiaptin, a lectin 



present in many tissues, has been found in several skin tumor 
types including melanoma, and its reduced expression. level has 
been suggested as a means of diagnosing malignancy (26). 
MOWSE was unable to predict the identity of both of these 

■ proteins from the obtained MALDI data. Only 15-25% of the 
MALDI masses obtained for spots 35 and 41 were consistent with 
unmodified tryptic peptides. Our studies show that for any given 
MOWSE search, numerous proteins present in the data base 
appear to be randomly capable of matching «*30% of the masses 

-in a given list. Below this level we have little confidence in 
MOWSE predictions and find sequence determination necessary 
for protein identification. • 

Triosephosphate isomerase (TPI), present in both spot 37 and 
spot 42, was readily predicted by MOWSE (Table 2). The success 
of MOWSE is noteworthy despite the fact that several of the 
MALDI masses are quite inaccurate (an error of more than ±3.5 
Da, attributed to an inconsistently performing laser power sup- 
ply). Subsequent high-energy CID and Edman degradation elu- 
cidated the sequences of several covalently .modified peptides and 
established the identity of TPI in both spots. Peptides that 
represented a nonspecific trypsin cleavage between His-95 and 
Ser-96 and chemical modifications, including acrylamide- 
modified cysteines, oxidation, and carbamylation, were found. 
The chemical modification of cysteine arose from protein inter- 
action with the gel matrix. Carbamylation is likely a consequence 
of using 2 M urea buffer during enzymatic digestion of spot 37. 
In-gel digestion with. 100 mM ammonium bicarbonate was done 
for spot 42 rather than electroeiution, thus eliminating the need 
for urea. 

TPI, which catalyzes the interconversion of dihydroxy ace- 
tone phosphate and glyceraldehyde 3 -phosphate in glycolysis, 
giuc oncogenesis, fatty acid synthesis, and the pentose shunt, is 




5076 Biochemistry: Clauser et at 
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800 , 900 



1000 



1100 



FiC. 3. High-energy CID spectrum of acetylated N-terminal tryp- 
tic peptide of nonmuscie tropomyosin (1-10 pmol) from spot 33, MH + 
= 1199.7 (monoisotopic mass). Sequence: CH3CO-AGLNSLEAVKR. 

. Peptide backbone cleavage ions associated with charge retention at the 
C terminus are denoted byx,y, and z; and at the N terminus by a and 

- b. Side-chain fragment ions are denoted by v and w. 

the product of a single gene^ However, multiple electro- 
phoretic forms have been observed, and some result from 
deamidations at Asn-71 and Asn-15 or oxidation of Cys-126 
(27). By high-energy CID, an Asn-71 containing peptide in 
spot 37 (mass 1620.9 in Table 2) was not. deamidated. The 
corresponding peptide was not found in spot 42. All five 
cysteines in TPI were shown by high-energy CID, Edman 
degradation, or mass matching to be consistent with acryl- 
• amide modification for spots 37 and 42. Throughout this work 
we have found cysteines exclusively in the acrylamide-modifled 
form, despite inclusion of the antioxidant sodium thioglycolate 
during electrophoresis. 
We have not currently pursued the strategy outlined in Fig. 
■ 2 which uses mass spectrometry to determine accurate masses 
on intact proteins isolated by 2D PAGE. For proteins in 
solution, mass determinations accurate to 0.01% (±2 Da.on a 
20-kDa protein) may be obtainable by MALDI or electrospray 
ionization mass spectrometry (28). Accurate intact masses 
could suggest the existence and possible identity of posttrans- 
lationai modifications or confirm an expected protein se- 
quence. However, mass determination on intact proteins iso- 
lated by 2D PAGE is confounded by problems associated with 
removal of stained protein from the gel matrix in a format 
amenable to mass spectrometry. Initial attempts at MALDI 
from proteins and peptides blotted onto polyvinylidene dif- 
luoride and nylon membranes are encouraging (29-31). 

Since our strategy is based on obtaining sequence informa- 
tion at fmol to low pmol levels, discovery of previously 
unknown peptide sequences will enable, design of oligonucle- 
otide probes for gene cloning. Furthermore, the growth of 
sequence data bases fueled by the Human Genome Project will 
increasingly reduce the chances of discovering unknown pro- 
teins. However, the discriminating power of a peptide mass- 
matching strategy employing ±2-3 Da MALDI mass data will 
progressively decline with data-base growth, thus producing 
increasing numbers of inconclusive results, necessitating se- 
quencing. Enhanced mass accuracy obtained by incorporating 
electrospray ionization mass spectrometry or improvements in 
MALDI technology would slow the anticipated decline in 
discriminating power of mass-matching strategies by reducing 
the effective portion of the data base considered in a search. 
Hence, when protein recognition is sought without need for 
primary structure characterization, the. identities of many 
known proteins may be rapidly predicted by combining mass 
matching with direct MALDI analysis of unfractionated di- 
gests (5, 6, 9, 10). Unfortunately, the ability to directly perform, 
substantial sequence analysis is severely compromised. Of the 
11 proteins we characterized from 10 gel spots, 6 were readily 
predicted by mass matching with MOWSE, and 9 contained 
unmatched covalently modified peptides. Consequently, our 



results illustrate the necessity of peptide sequencing and the 
advantages of tandem mass spectrometry to rapidly and un- 
ambiguously identify proteins isolated by 2D PAGE. 
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I, Erkki Ruoslahti, declare as follows: 



1) I am the Erkki Ruoslahti who is named as a co-inventor of the 
above-identified patent application. 



2) I understand that the claims of the subject application stand rejected, in 



part, on the basis that one skilled in the art allegedly would not have been able to identify 
homing molecules by in vivo panning with untagged libraries of molecules at the time the 
priority application was filed. 



Sir: 
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3) I believe that in 1995, at the time the priority application for the 
above-identified application was filed, an ordinary scientist using the teachings of the 
specification would have been able to use untagged libraries such as peptide and small molecule 
libraries in the claimed in vivo panning methods to recover and identify molecules that home to a 
selected organ or tissue. 

4) Corroboration of identification of homing molecules using an untagged 
small molecule library in accordance with the teachings of the patent specification is provided 
herein in paragraphs 5 to 8, which describe identification of a homing molecule from a library of 
10 small molecules injected into the circulation of mice. As described further below, mass 
spectrometric analysis of extracts from organs harvested 10 minutes following intravenous 
injection of the library showed that one compound homed selectively to brain. These results will 
shortly be published in the peer-reviewed journal of ChemBiochem. The galley proofs of this 
publication (Brown et al., ChemBiochem. 5:1-9 (2004)) are attached as Exhibit A. 

5) The library of 10 small molecules was selected from a much larger library 
containing pharmacologically active compounds. The structures of the 10 small molecules, 
which were selected by an individual who had no involvement with the subsequent in vivo 
panning experiments, and their molecular weights are shown in Table 1 . Each of the ten 
compounds was present at a final concentration of 1 mM in phosphate buffer (40 mM, pH 7.2). 

6) To identify homing molecules, two-month-old female BALB/c mice were 
anesthetized with avertin. Mice were injected intravenously in the tail- vein with 200 \i\ of 
library solution (200 nmol per compound). After ten minutes of circulation, organs were 
harvested and washed with phosphate-buffered saline (PBS). Each organ was mixed with 5 ml 
acetone and homogenized with a Handishear hand-held homogenizer (Virtis; Gardener, NY). 
Organ/acetone homogenates were transferred to 15 ml centrifuge tubes and incubated at -80°C 
for 12 hours to precipitate proteins. Following centrifugation at 3,000 x g for 30 minutes at 4°C, 
supernatants were recovered and dried in a SpeedVac. Organ extracts prepared from mice 
injected with PBS served as an internal control for the experiment. 
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7) Dried organ extracts were resuspended in 50 |il acetonitrile. The samples 
were then diluted 1 :2 in 20 mg/ml alpha-cyano-4-hydroxycinnamic acid in 50% acetonitrile and 
0.1% trifluoroacetic acid. Finally, one-half of the diluted sample was analyzed on an Applied 
Biosystems MALDI-TOF Voyager DE-Pro mass spectrometer at The Burnham Institute. 

8) As shown in Figure 1, a striking peak with m/z = 301 was present in the 
brain extract from library-injected mice but was absent from brain extract of control mice. Based 
on the molecular weight of 301 Da, the brain-homing compound could be identified from the 
other components of the injected library as compound "2C11" (see Table 1). This 301 Da 
compound is a pharmacologically active benzodiazepine, Oxazepam, known to cross the blood- 
brain barrier. Furthermore, as shown in Figure 1, a biologically inactive 265 Da benzodiazepine 
was not detected in the brain (left panel, arrow beneath X-axis). It is striking that we were able 
to make this observation in a blinded manner and identify the benzodiazepine, even though the 
MALDI-TOF mass spectrometer was not optimally accurate at the molecular weight range in 
this study. These results indicate that organ-homing molecules can be isolated from a chemical 
library by in vivo panning and subsequently identified using mass spectrometry. 

9) Further corroboration of the identification of homing molecules using an 
untagged small molecule library in accordance with the teachings of the patent specification is 
provided herein in paragraphs 10 to 20, which describe identification of homing molecules from 
a library of 75 random small molecules. As described in the paragraphs which follow, selected 
organs (kidney, liver, lung and brain) were harvested in organic solvent to precipitate proteins, 
and molecules from the library were subsequently identified in the soluble phase using 
electrospray mass spectrometry. 

10) In particular, 75 organic compounds were randomly selected from a 
420,000-member library from ChemBridge (San Diego, CA). There was high structural 
diversity among the 75 organic compounds, and the masses of the compounds differed from each 
other by at least 4 Da. The library was resuspended in dimethylsulfoxide (DMSO), with each 
individual compound at a final concentration of 1.33 mM. The 75 ChemBridge compounds and 



Inventors: Ruoslahti and Pasqualini 
Serial No.: 09/922,227 
Filed: August 2, 2001 
Page 4 

their masses are shown in Table 2. In vivo panning was performed as described above, except 
that mice were injected intravenously in the tail-vein with 25 of library (33 nmols per 
compound), and control mice were injected with a single compound known to lack homing 
activity. In some cases, 250 pmol to 2.5 nmol of control compound (ChemBridge 51 16670, 
molar mass of 340 Da) was added as a reference for quantification of the amount of homing 
molecule in a target organ. In addition, the organ extracts from mice injected with the 75- 
member library were not analyzed by MALDI-TOF mass spectrometry as described above but 
were prepared for subsequent analysis on a electrospray mass spectrometer. The dried organ 
extracts were resuspended in 100 [il of methanol, vortexed for about 10-20 minutes, and 
centrifiiged to pellet debris. The supernatants were recovered, further diluted 1 :20 in methanol, 
and 20 |il of the diluted sample was analyzed on a Waters® Micromass® LCT mass spectrometer 
(Milford, MA) at The Scripps Research Institute (La Jolla, CA). The mobile phase of the liquid 
chromatography was run with 90% methanol, 9% water and 1% acetonitrile. This form of mass 
spectrometry was more suitable than MALDI-TOF for analyzing compounds having a mass of 
300-600 Daltons in tissue extracts. 

1 1) The mass spectra of the tissue extracts from mice injected with the 
75-compound library were analyzed as follows. Candidate organ-homing molecules were 
represented as various peaks which appeared in organ extracts from library-injected mice but 
were absent from the corresponding organ extracts of control mice. High intensity peaks that 
appeared in both extracts from library-injected mice and control mice were excluded from the 
analysis; using this criteria, compounds represented by peaks at m/z = 437 and m/z = 453 were 
excluded as potential organ homing molecules. Other spectral peaks from tissue extracts of 
library-injected mice were also found in extracts of control-injected mice when the sensitivity of 
detection was increased by focusing the mass spectrometer on a narrower mass range. 
Extraneous peaks observed only in extracts from control mice, including the oscillatory signals 
observed in some analyses, also were excluded. In addition, compounds represented by peaks 
present at significant amounts in several different organs, such as compounds having m/z = 408 
and m/z = 424, were excluded as potential organ homing molecules due to a lack of specificity. 
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Peaks of interest were analyzed by comparison to the theoretical masses of the 75 individual 
ChemBridge compounds shown in Table 2. Figures 2A (kidney), 2B (liver), 2C (lung) and 2D 
(brain) show the results of initial screening experiments in which the library (upper panel) or 
control compound (lower panel) was injected into mice. Homing data from two different mice 
were generally analyzed. Where results between the two mice differed significantly, results from 
the second mouse are shown as an inset to the figure. 

12) Representative spectra from a kidney extract prepared from a 
library-injected mouse and from a control kidney extract from a mouse injected with a non- 
targeting compound are shown in Figure 2A. High intensity peaks present in kidney extract from 
a library-injected mouse had masses as follows: 298, 392, 408, 424, 432, 437, 453, 500 and 518 
(see Figure 2A). Of these, peaks at 437 and 453 were excluded as present in control extracts, 
and peaks at 408 and 424 were excluded as present in multiple different organ extracts. The 
remaining peaks (at 298, 392, 432, 500 and 518) were identified as representing potential 
kidney-homing compounds and were identified by comparison of m/z value of each peak to the 
theoretical mass of the 75 compounds in the starting library, as summarized in Table 3 below. 

13) When tested individually for their ability to home to selected organs, 
ChemBridge compounds 5862461 and 6074428 accumulated in kidney and did not localize to 
any other tissue (Figures 3 A and B, respectively). Compound 5343617 also localized to the 
kidney as shown in Figure 3C, although this compound accumulated preferentially in the liver 
and also was observed in the lung. Of the two remaining potential kidney-homing molecules, 
compound 51 16670 accumulated non-specifically when injected individually while compound 
5536652 was not tested. In sum, these results confirm that mass spectrometry can be used to 
identify organ-homing molecules such as kidney-homing molecules from an untagged small 
molecule library. 
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Table 3 




Potential kidney-homing compounds identified in library screening 


ChemB ridge 
compound 


Observed peak 


1 neoretical 
molecular weight 


Homing activity observed wnen 
molecule injected individually 


5343617 


298 


300.08 


Predominantly liver; also lung and 
kidney 


5116670 


392 


392.01 


Non-specific accumulation in all 
four organs 


5862461 


432 


432.07 


kidney 


6074428 


500 


500.01 


kidney 


5536652 


518 


520.01 


not tested individually 



14) Representative spectra from a liver extract prepared from a library-injected mouse 
(upper panel) and a control liver extract from a mouse injected with a non-targeting compound 
(lower panel) are shown in Figure 2B. In the liver extract from the library-injected mouse, nine 
high intensity peaks were observed (m/z = 298, 364, 392, 408, 424, 437, 453, 472 and 516). As 
before, peaks at 437 and 453 were excluded as present in control extracts, and peaks at 408 and 
424 were excluded as common to several organ extracts. As summarized in Table 4, the 
remaining five peaks, with m/z = 298, 364, 392, 472 and 516, were identified as representing 
potential liver-homing compounds, and were subsequently injected into animals individually for 
further characterization of homing activity. 

15) When tested individually for their ability to home to selected organs, 
ChemBridge compound 5343617 accumulated in liver, and to a lesser extent in lung and kidney, 
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corroborating that this molecule is a liver-homing molecule. The remaining molecules either 
accumulated non-specifically or were not detected when analyzed individually. 







Table 4 




Potential liver-homing compounds identified in library screening 


L-neniBriage 
compound 


unserved peaK 


i ueoreiicai 
molecular weight 


Homing activity ooservea wnen 
molecule injected individually 


5343617 


301/302 


300.08 


Predominantly liver; also lung 
and kidney 


5216419 


364 


364.03 


Not detected 


5116670 


392 


392.01 


Non-specific accumulation in all 
four organs 


6873050 


472 


472.00 


Non-specific accumulation in all 
four organs 


7567423 


516 


516.01 


Liver, kidney and brain 



16) The same library was analyzed by in vivo panning for identification of 
lung-homing compounds. Representative spectra from a lung extract prepared from a library- 
injected mouse (upper panel) and a control lung extract from a mouse injected with a known, 
non-homing compound (lower panel) are shown in Figure 2C. Approximately seven high 
intensity peaks were observed in the lung extract from the library-injected mouse, at m/z = 
301/302, 392, 408, 424, 437, 453 and 582. As before, peaks at 437 and 453 were discounted 
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since they appeared in control extracts, and peaks at 408 and 424 were discounted for their non- 
selective accumulation in different organ extracts. By comparison of the remaining three peaks 
(at m/z = 301/302, 392 and 582) to the theoretical masses of the components of the small 
molecule library, four compounds (ChemBridge 5343617, 51 16670, 7609370 and 5710134) 
were identified as representing potential lung-homing compounds (Table 5). Each of these 
molecules were subsequently injected individually into mice in order to corroborate lung homing 
activity. 

17) As discussed above, ChemBridge compound 5343617 localized, in part, to 
the lung. The remaining three compounds, 51 16670, 7609370 and 5710134, when injected 
individually, were observed to accumulate non-specifically or were not detected (see Table 5). 







Table 5 




Potential lung-homing compounds identified in library screening 


ChemBridge 
compound 


Observed peak 


Theoretical 
molecular weight 


Homing activity 


5343617 


298 


300.08 


Predominantly liver; also lung and 
kidney 


5116670 


392 


392.01 


Non-specific accumulation in all 
four organs 


7609370 


582 


580.68 


Not detected 


5710134 


582 


584.46 


Lung, liver and kidney 
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1 8) The results of in vivo panning for identification of brain-homing molecules 
are shown in Figure 2D, which represents spectra from a brain extract prepared from a library- 
injected mouse (upper panel) and a control brain extract from a mouse injected with a control, 
non-targeting compound (lower panel). As shown in Figure 2D, no brain homing molecules 
were identified in the library of 75 random small molecules which was screened. The only high 
intensity peaks present in the brain extract from the library-injected mouse were also present in 
the mouse injected with control compound. 

19) Representative individual homing experiments are shown in Figures 3 A, 
3B and 3C. Figure 3A shows detection of compound 5862461 (m/z = 429, 431 and 433) in a 
kidney extract after intravenous injection. Figure 3B shows that compound 6074428 (m/z = 499) 
localizes to the kidney following intravenous injection. Figure 3C shows that compound 
5343617 (m/z = doublet at 298 and 300) homes to liver and lung following intravenous injection. 
These results, and the structures of the various ChemBridge molecules, are summarized in 
Figure 4. 

20) Several of the organ-homing molecules contained one or more bromine 
atoms. Bromine exists in nature as two approximately equally abundant isotopes differing in 
mass by two daltons. When organ-homing molecules were assayed individually, molecules 
containing bromine exhibited characteristic split spectral peaks. In particular, ChemBridge 
molecule 5343617, which contains a single bromine atom, displayed a characteristic doublet at 
m/z = 298 and 300 (see Figure 3C). Similarly, ChemBridge molecule 5862461 which has two 
bromine atoms, displayed characteristic spectral peaks in the ratio of approximately 1:2:1 at m/z 
= 429, 431 and 433 (Figure 3 A, inset). These characteristic spectral peaks corroborate 
identification of the bromine-containing ChemBridge molecules. 
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21) In sum, these results demonstrate that one of skill in the art would have 
been able to identify homing molecules following in vivo panning of untagged small molecule 
libraries using techniques such as mass spectrometry, which were routine in the art at the time 
the priority application was filed. 

I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further that 
these statements are made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that any such willful false statement may jeopardize the validity of the 
application or any patent issued thereon. 



Date: 




Exhibit A 



Drug Identification through in vivo 
Screening of Chemical Libraries 

Darren M. Brown, Maurizio Pellecchia, and 
Erkki Ruoslahti* [a] 

Specific cell-surface molecules can direct leukocytes and cer- 
tain tumor cells to particular organs. 11 " 31 Recent work by our 
group has shown that peptides, selected by using the in vivo 
phage-screening approach, are also capable of mediating se- 
lective in vivo localization of phages to individual organs as 
well as tumorsJ 4 * 61 To develop our targeting technology 
beyond peptide-based systems, we investigated the feasibility 
of screening a chemical library to identify small molecules 
other than peptides that possess a preferential affinity for par- 
ticular organs or tissues. As a proof of principle, we screened 
two different chemical libraries in vivo and identified three 
compounds that preferentially accumulated in individual 
organs: a pharmacologically active benzodiazepine localized in 
the brain, another compound specifically homed in on the 
liver, and the third on the kidneys. These results show that it is 
possible to use in vivo chemical-library screening to identify 
compounds that distribute themselves to specific sites in the/'* 
body. Such knowledge can focus drug discovery on com- 
pounds with promising pharmacokinetic and tissue specificity, 
profiles. 

For in vivo screening of chemical libraries, we injected mix- 
tures of small molecules into the circulation of mice;' harvested^ 
selected organs in organic solvent to precipitate proteip;|ahd 
detected the presence of compounds from the librai^ini|he 
soluble phase by mass spectrometry. We jnitiajly tested a 
brary of ten compounds. Mass-spectrometric^analysis offex- 
tracts from organs harvested 10 min afte%the intravenous^in- 
jection of the library showed that one com ^oun^ preferment ia I ly 
accumulated in the brain (Figure Breaking the code for 
the compounds revealed thiOO|^a\om pound Hofbe a ben- ■ 
zodiazepine known as Oxazepam;!!} A biologically inactive 
265 Da benzodiazepine also present iruthe library was not de-( 
tected in the brain (FigurejA)>Thu£ it appeared possible to 




obtain organ-targeting ^small moleculesrby screening chemical 



libraries in vivo'/ We alscnlearned from these early studies^that^ 
it was easier tot detect library^ compounds and differentiate 

♦u ; f ^ m ^ * i i • * * «. 

them fromiiendoqenous^tissue molecules in organic extracts 
analyzed by, electros pray^mass spectrometry when the library 
molecules had molar£masses|greater than 300 Da. 

To test a la rger^i brary, We^assembled a mixture of 75 com- 
pounds with molaif^masses between 300 and 600 Da and 
screened for compounds that home in on the brain, liver, 
lungs, or kidneys. Mass spectrometry performed on organ ex- 
tracts from library-injected mice identified ten molecules as 
candidate organ-homing compounds. These ten compounds 



[a] Dr. D. M. Brown, Prof. M. Petlecchia, Prof. E. Ruostahti 
Cancer Research Center, The Burn ham Institute 
10901 North Torrey Pines Road, La Jolta, CA 92037 (USA) 
Fax: (+1)858-646-3198 
E-mail: ruoslahti@burnham.org 
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Brain (Library) 



301.0 



yi i M i lpUn 

260| 280 



240 



300 320 



Brain (Control) 



240 260 280 300 320 
m/z ► 



B Kidney 




Control Kidney 
421 



422 





400 



425 
m/z - 



465 



450 
► 



■'IT*' 

475 



were tested individually for their ability to specifically, target in- 
dividual organs. Compounds 5862461|Vand 6074^8 nwere 
found to accumulate in the kidneys, (Figure IB and'Qf The 
other tissues tested negative forjthese-two compounds. Com- 
pound 5343617 was found |3ri%^rily\jn^the liverPand, to a 
lesser extent, the lungs andilciSneys (gJ^ure 1 D). The spectral 




patterns of compounds 5862461 and^p43617^were particularly^ 
distinct because these^ f co^pbu^sfc<>ntain bromine/whicrk 




exists as two equaHvTabundant natural%6topes, l8] and causes a V 
characteristic^tvv^-mass^uni^split in the spectral peak (Fig- 
ure 1 B, inse^.^Ghe^ompound^accumulated in the lungs, kid- 
neys, and liver, but^hot-theUbrain; and another localized to the 
brain, kidneys^and liver^but not? the lungs (data not shown). 
These compou|S?a'reilike^to/bind to receptors that are ex- 
pressed in more^than one^tislfue, but the varying tissue selec- 
tivity of these compbtinds clearly indicates tissue-specific 
homing. Extracts from the organs of control-injected mice con- 
firmed that no molecules matched the spectral pattern of the 
homing compounds. Two other candidate organ-homing com- 
pounds localized to all four tested organs. These compounds 
might bind to molecules present in all tissues, but it is also 
possible that their concentration in blood remaining in tissues 
is high enough to allow detection. As these compounds did 
not show any tissue-specific homing, we did not study them 
further. For three compounds, the specific organ homing could 



© 2004 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.chembiochem.org ChemBioChem 2004, 5, 1-9 



Dateiname: Z300824 Pagtna: 2. 

Seite: 21* von a Umfang (Selten): 9 

Datum: 15 KW.. 8. April 2004 (Donner stag} Sprachs American 

Zeit: 14:33:05 Uhr 



Kidney 



514 



499 



498 



480 500 520 
mli - 



514 



498 



480 500 520 
— ► 



Lung 



489 



480 



514 



500 



480 500 520 
m/z 



480 500 520 



Liver 



Brain 



514 



+ 




498 j 








(J 





514 



480 500 520 480 500 520 
m/z ► 



Kidney 



302 
298 [303 

290 300 310 
m/z — 



303 



297. 



290 300 310 



298' 



Liver 



300 



302 



290 300 



310 
m/z - 



'480 ' 566 520 " 480 500 520 



m/z 



A 

. NX 

.'X N-- 

fry 

>N ' 




44 



290 



290f ;300. 



31 0* 



-4: 



F/gure 7. /n vivo targeting* 6f small molecules to particular organs. A) Detection of the benzodia- 
zepine, Oxazepam,fm the bram\Wwjn after intravenous injection with^alten^cprhpound library. 
"Control" mice wereJnjected with^vehicle^ alone. The downward pointing arrow denotes the spec- 
tral peak for Oxazepam. The arrow below^the axis denotes the m/z of the biologically inactive 
benzodiazepine in thefiibfali0N6Tpeak is seen at this position. B-D) Mice were intravenously in- 
jected with indiyidualcompoundsJrom the 75-member library, and tissues were analyzed 10 min 
later by mass spepr6m"elr^^^Detectionipf compound 5862461 in the kidneys after intravenous 
injection and circulation for 10 min? 'Control" denotes mice injected just with DMSO. C) Com- 
pound 6074428 target£the.kia£eys'MWor lungs ?■ D) Compound 5343617 targets the liver 
and lungs. Compound peak heights are shown as relative signal intensity (\). A "+" denotes 
compound-injected mice and "—" denotes DM50~injected mice. The downward pointing arrows 
mark the spectral peaks for the organ-homing compounds. 
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not be confirmed in individual testing. The remaining 68 com- 
pounds were not detected in any tissue, apparently because 
they did not sufficiently accumulate in any of the test tissues 
to bring the concentration above the detection limit. 

We next quantified the organ accumulation of the three 
compounds with the most promising organ-homing proper- 
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CHEMBIO 




ties. We used the mass spectrometer to compare the relative 
amounts of targeting compound in extracts of different 
organs. Compound 6074428 was at least 30-fold more concen- 
trated in the lungs BHor kidneys?>> than in the liver, kid- 
neys, and brain (Table!). At least 2.4 times more compound 



Table 1. Homing specificity of compounds and their accumulation in target 
organs. The structure, target organ, and homing activity of the three organ- 
homing compounds are shown. The quantity of homing compound in the 
target organs Wmin after an intravenous injection of individual com- 
pounds was determined as described in the Experimental Sectoin. The accu- 
mulation of targeting compound was expressed as normalized signal inten- 
sity level relative to the detection limit 



Organ-targeting 
Compound 



Target Signal Intensity 1 * 1 
Organ(s) (fold higher than 
detection limit) 




kidney 2.4-fold (±0.6) 



kidney 3 2-fold (± SAW*** ^ 



ChemBridge 6074428 
O 




■ Blunge | 

liver ( 67-fold (± s 2Tl)f^ 
lung xrU- 2 - fold (±0^6)4 




ChemBridge 534361 7 



/^r-}'% kidney 1.2-fold (±0.3) 



[a] Accumulation data is represented as meanl percentage (± stand ardjde- 
viation) for two experiments per variable. 




5862461 localized to A the;kidneys than to the liver, Iungs7§nd 
brain. Compound 1534361 7 accumulated very strongly in the 
liver; about 55-fold higher levels were detected in the liver 
than in the^kidneys, which contained a trace amount of the 
compound. This compound /V was also present at moderate 
levels in the lungs, but was not detectable in the brain. As 
each of these three compounds accumulated in different tis- 
sues BBsee aboveBH, their organ-selective homing is clear- 
ly specific and not due to the presence of blood or unspecific 
trapping in the target organs. 

We then measured two parameters that influence the sensi- 
tivity of in vivo chemical-library screening. First, we used the 
mass spectrometer to analyze the spectral intensity of nine dif- 
ferent compounds added to organ extracts, and found that 
the smallest amount of an individual compound that could be 
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detected in a tissue extract was between 34 and 215 pmol. For 
the second parameter, we determined the smallest amount of 
homing compound that could be injected and still detected in 
our in vivo screening system. For this analysis, the signal inten- 
sity of compound 6074428 in kidney extracts from mice inject- 
ed with various amounts (2 to 125 nmol) of the compound 
was determined by mass spectrometry. The spectral peak at 
m/z 499 from compound 6074428 was detectable in kidney 
extracts from mice injected with as little as 7.8 nmol of target- 
ing compound (Figure 2). In the initial library screen with 75 
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Figure 2. The lower detection limit of in vivo chemical-library screeningTMass-spectrometric analysis of kidney extracts 
from mice injected with either A) 125 nmol or B) 7.8 nmol^yf compound 607^J28.£The peak intensitieywerefnormalized 
to the height of an endogenous tissue molecule at n^r421^that was consis&ntjy^a'etected in kidney extracts ^circled). 
The asterisk denotes the spectral peak for the kidneyMM?MM : ho/ning compound, 6074428. The spectral peak inten- 



sities of other endogenous tissue molecules (e.g. thelmolecules i dWto\te9?ana' 481) varied between 'experiments; as a 



result, they were not used to normalize the spectralipjept^intensitiesid^corhpound 6074428. Compound peak heights 
are shown as relative signal intensity (\). /|f(^ ^l^^s^ \- 




compounds, about 33 nmo^%eJc|^6lecule was present in 
the injected library mix. Tnere ^ r ^^'^^^'y tnat 300 com l 
pounds could be teste^j^Bingle ^creening^round for organr ^ 
targeting compounds. Given tn^SSa^and simplicitylof thifte 
screening technique, ajibrary of 10000 compounds could bej 
screened in ^vivo\in a few weeks with a relatively small-stale 
effort. f^4,A "\ 

We encounteredToT^^limitatibns with in vivo chemical-li- 
brary screen ing^thatJvil I Be, addressed in future studies. The 
volume of library injected ^hto^the mice (25 ui_) was limited by 
the toxicity of the^solyentfclimethyl sulfoxide (DMSO), With a 
less toxic solvent, it should be possible to inject up to 200 ul. 
of library and screen potentially as many as 3600 compounds 
in one round. Emulsifying agents like Cremophor* EL, Emul- 
phor* polysorbate 80, Solutol* HS15, or solvents containing N- 
methylpyrrolidone could be used as an alternative to DMSO 
when solubilizing the chemical library before in vivo screening. 
In addition, only 1 % of the organ extract could be analyzed by 
mass spectrometry due to the presence of various endogenous 
tissue compounds in the acetone extracts. A more selective ex- 
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traction and prepurifkation method could increase the sensi- 
tivity of the compound detection by mass spectrometry. 

The biological basis for the targeting activity of some of the 
compounds identified in the screen has yet to be determined. 
However, it seems likely that binding to benzodiazepine recep- 
tors mediated the brain-homing activity of the pharmacologi- 
cally active benzodiazepine, as the related inactive compound 
did not accumulate in the brain. The kidney-homing com- 
pound, 6074428, contains a benzenesulfonamide group that is 
known to have diuretic properties; perhaps this group medi- 
ates the kidney-homing activity of this compound. 

This work provides the first demonstration that it is possible 
to conduct large-scale screening of chemical libraries in vivo. 
Such screening can identify targeted small molecules for use in 
a variety of applications and has some advantages over previ- 
ous methods. In vivo phage screening primarily targets the 
vascular endothelium. Low-molecular-weight chemical com- 
pounds can target the vasculature, but are also likely to gain 
access to parenchymal cells in tissues. That parenchymal cells 
can be targets is suggested by our recovery of a benzodiaze- 
pine as a brain-homing molecule, as most receptors for these 
compounds are on the neurons. As an additional advantage, 
this screening approach does not require encoded or tagged 
library compounds. This is an improvement over other ap-^ 
proaches that require separate chemistries for coupling differ- 
ent small molecules to synthetic or genetically engineered<tags^ 
such as bacteriophages. 191 In addition, the absence okom r ' 
pound tags eliminates the possibility of interference by, the tag 
with the in vivo homing activity. >^> 
The localization of selective molecules to specific "addresses'' 
on the endothelium suggests that each tissue puts a special- . 
ized signature on its vasculature. 1101 Orgafefpecific vascular 
molecules are attractive targets for the jtel i veiy W 
to particular sites. By conjugating targetinJ t ^oieties|toid^gs, 
diseases such as cancer can be treated wilmf increased efficacy 
and fewer side effects; 111121 phage-tejved homing^ peptides 
and peptidomimetics have beenfused injthis manner to target ^\ 
malignant tumors. 1 "" 151 Organ^^^iir^^^mpounds isolated c ^ 



from chemical libraries an 
poses. 



In vivo screeninqf^may^also iderWify%mall molecules that ; 
have pharmacological effectslat the target organ. The identifi- 
cation of a ; neuroactive compound and a potential diuretic as 
brain- and| kidneyrhqhiihgj^molecules, respectively, suggests 
that this may.be^possiblel^rhus^in vivo screening has the po- 
tential to advance drug discovery; it allows pharmacokinetics 
and specificity of%action tq^be studied among large numbers 
of candidate compounds, or even from completely random li- 
braries. Such approaches may accelerate the discovery and de- 
velopment of new drugs. 




Experimental Section 

A library of ten small molecules with molecular weights between 
200 and 300 Da was prepared by a person not involved in the in 
vivo experimentation and was tested blindly. The ten-compound li- 
brary was prepared in phosphate buffer (40 rriM, pH 7.2) with each 
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molecule at a final concentration of 1 mM. A larger library of small 
molecules was prepared from 75 organic molecules (purchased 
from ChemBridge, San Diego, CA) with molecular weights between 
300 and 600 Da. The library compounds were randomly selected 
from a 420000-member ChemBridge library, with each compound 
satisfying the following criteria: 1)the partition coefficient, ex- 
pressed numerically as \ogP, was less than 5 and 2) the molecular 
weights of the compounds differed from each other by at least 
4 Da. There was high structural diversity in the library, given that 
the only limitation was the compounds selected from the 420000- 
member parent library needed to fit the parameters described 
above. The 75-compound library was resuspended in DMSO, with 
each molecule at a final concentration of 133 mM. 

To identify molecules that localize to particular organs, two-month- 
old female Balb/c mice were anesthetized with avertin (0.15 jxL g" 1 ) 
administered intraperitoneal!/ In experiments with the ten-com- 
pound library, 200 ul of library solution (200 nmol per compound) 
was intravenously injected into the tail vein. With the 75-com- 
pound library, 25 jxL of library solution (33 nmol per compound) 
was intravenously injected into the tail-vein. After 10 min of circu- 
lation, the lungs, liver, kidneys, and brain were removed. We found 
5-15 min to be optimal for the screening of intravenously injected 
phage for homing to individual tissues and tumors/ 161 and we 
wanted to keep the time short enough to prevent metabolism of 
the injected compounds, which would change their mass-spectro- 
metric signature. 

The organs were washed with PBS (5 mL) to remove excess blood 
and weighed. Each organ was mixed with acetone (5 ml) andftfen; 
homogenized with a Handishear hand-held homogenizerj (Virtis, 
Gardiner, NY). For certain organ homogenates, a control xonri I pound 
(ChemBridge 5116670, molar mass 340 Da, 0.25-2.5 nmol) .was-- 
added as a reference to quantify the amount of homing com- 
pound in target organs. The organ/acetone homogenates^were 
transferred to 15 mL centrifuge tubes and incubated at -80°C s f6r:|i^ 
12 h to precipitate the proteins. Following cenfrifugation |for 
30 min at 3000 g and 4°C, the supernatants, were recovered^ ancl 
dried in a Speed Vac. A set of control organ;: extracts wasfalsolpre- 
pared from mice that were injected witfcpure % DMSO,(25 \xL) f 

The dried organ extracts were resuspended in methanol (100 uL), 
spun in a vortex for about lOJgOJriin, anc%separated in a centri- 




fuge to give pellet debris ■ |6k?B KThelsljpernatants were re- f #% 
covered, further diluted 1 ijpjm^methanol^ diluted sample? / 



(20 llL) was analyzed on rWa^l^lcrbmass" LCT mass spectromek 
ter (Milford, MA) ^at^ThT^Sajpps Researchilnstitute (La Jp|a, CA). 
The mobile Pp*& of tr^liqiricl chromatography was rur^with** 
methanol/wate^a^etQnitrile^90:9:1). By comparing the masses of 
the individual comS^ds.and the^molecules in the organ extracts 
of mice injected with DMSp to the .molecules in the organ extracts 
from the mice^injectedSwitlftthe library, we were able to identify 
molecules in the^library thatllocalized to a particular organ. 

The accumulation drcompounds in organs was measured as fol- 
lows: 

We first measured the signal intensities of the targeting com- 
pounds using the mass spectrometer and compared them to the 
signal intensity of a standard compound that was added to the 
organ extracts; this enabled us to normalize the intensity value of 
compound peaks from experiments performed on different days. 
We then determined the smallest amount of an individual com- 
pound that could be detected in a tissue extract using mass spec- 
trometry by measuring the spectral intensity of nine different com- 
pounds added in small amounts to organ extracts. The detection 
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limit was defined as the spectral intensity level halfway between 
the background noise and the spectral intensity level generated 
from the smallest detectable amount of compound in organ ex- 
tracts (averaged from nine different compounds whose spectra 
were analyzed and displayed with a scanning window of m/z 290- 
610). The normalized intensity values for homing compounds were 
compared to the detection limit to determine the degree of en- 
richment of compound in target organs relative to background 
levels. These enrichment values were not comparable from com- 
pound to compound, since each compound has a different ioniza- 
tion efficiency and stability on the mass spectrometer. 

The Burnham Institute Animal Research Committee approved the 
animal experimentation in compliance wiht the relevent US laws. 
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COMMUNICATIONS 




D, M. Brown, M. Pellecchia, 

E. Ruoslahti* 



Drug Identification through 
in vivo Screening of 
Chemical Libraries 



Drug discovery process stood on its head? A proof-of-principle study has shown that large ,N V , 
numbers of chemical compounds can be tested for selective accumulation in individual tissues "x^. ^ 
in live mice. The picture shows how the site of accumulation of the tail-vein-injected molecules |\ ;/ 
could be determined by mass spectrometry. The process picks drug candidates} baseckpn tissue/ / 
specificity and pharmacokinetics rather than defined activity, which is the usual starting point of,/ 
drug discovery. 
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TABLE 1 


COMPOUND 


STRUCTURE 


MOLECULAR WEIGHT 


1E4 


CA.A.XJ 

H H 


226.2774 


1B5 


ho °* ; W'-' , * t 


273.349 


1B6 




289.3052 


1D6 




240.3012 


1B12 


•O 


240.2148 


2A6 




264.3262 


2C11 




300.7439 


2D8 


OH 


243.2646 


1B1 




240.261 


2E3 




225.2464 



Table 2 



ChemBridge 
Compound 
number 


Mass (g) 


5343617 


300.08 


5135609 


304.08 


6098554 


^x ^x 

308.09 


7246197 


312.07 


5904415 


316.09 


5108221 


320.05 


6955991 


324 


7477972 


328.1 


5225540 


332.04 


7253800 


336.04 


^X J ^X ^X 

5231936 


340.09 


7383619 


344.09 


5403771 


348.03 


5279582 


352.12 


5377438 


356.08 


5550053 


360 


5216419 


364.03 


5276832 


368.12 


5155350 


372.07 


5809106 


376 


7257635 


380.01 


5225132 


384.02 


5380863 


388.06 


5116670 


392.01 


5624827 


396.04 



ChemBridge 
Compound 
number 


Mass (g) 


5578637 


404.02 


5217141 


408.11 


5300003 


412 


#** ^x ^x *x ^ ^x ^x 

5326482 


416.04 


5246030 


420.05 


6090295 


424.09 


7384366 


428.06 


5862461 


432.07 


5364112 


436.03 


7100798 


440.07 


5569100 


444.01 


6903967 


448.1 


6170510 


452.01 


5169028 


456.08 


5214985 


460.01 


5216127 


464 


5255244 


468.02 


6873050 


472 


6987235 


476.01 


6872990 


480 


6875321 


484.01 


5130527 


488.05 


6987469 


492.01 


5348584 


496 


6074428 


500.01 



ChemBridge 
Compound 
number 


Mass (g) 


7583971 


508.01 


5768124 


512 


7567423 


516.01 


5536652 


520.01 


5717564 


524.01 


7497180 


528.07 


5671388 


532 


5670039 


536.14 


5555479 


540.06 


7575548 


544.06 


7591015 


548.01 


5374146 


552.02 


6394103 


556.03 


5557349 


560.02 


5551154 


564.06 


5711954 


568.06 


5101382 


572.19 


5227898 


576.56 


7609370 


580.68 


5710134 


584.46 


5751093 


588.52 


6968226 


592.27 


5743815 


596.61 


5233904 


600.53 



